arXiv:1507.01309v2 [cs.DS] 29 Aug 2015 


APPROXIMATING (UNWEIGHTED) TREE AUGMENTATION VIA 
LIFT-AND-PROJECT, PART II 

JOSEPH CHERIYAN AND ZHIHAN GAO 


Abstract. In Part II, we study the unweighted Tree Augmentation Problem (TAP) via the Lasserre 
(Sum of Squares) system. We prove that the integrality ratio of an SDP relaxation (the Lasserre 
tightening of an LP relaxation) is < | + e, where e > 0 can be any small constant. We obtain this 
result by designing a polynomial-time algorithm for TAP that achieves an approximation guarantee 
of ( I -I- e) relative to the SDP relaxation. The algorithm is combinatorial and does not solve the SDP 
relaxation, but our analysis relies on the SDP relaxation. We generalize the combinatorial analysis of 
integral solutions from the previous literature to fractional solutions by identifying some properties 
of fractional solutions of the Lasserre system via the decomposition result of Karlin, Mathieu and 
Nguyen (IPCO 2011). 


1. Introduction 

In the weighted Tree Augmentation problem we are given a connected, undirected multigraph G 
with non-negative costs on the edges, together with a spanning tree T = (U(G), Et) of G; the goal 
is to find a set of edges, F C E{G)—Et, of minimum cost such that (U, Et U F) is 2-edge connected. 
By a link we mean an element of E{G)—Et', thus, a link is an edge of G that can be used to augment 
T. We say that a link uw covers an edge e G Et (a tree-edge) if T -|- uw — e = (V, Et U {rttc}—{e}) 
is connected. We say that a set of links F covers the tree T if every edge of T is covered by at least 
one link of E; it can be seen that F covers T iff {V,Et U F) is 2-edge-connected. Thus, the goal is 
to compute a set of links of minimum cost that covers T. 

The weighted Tree Augmentation problem was first studied by Frederickson and Jaja in 1981 
[4]. They showed that the problem is NP-hard, and they presented a 2-approximation algorithm. 
Subsequently, it has been proved that even the unweighted Tree Augmentation problem is APX-hard, 
see [9, Section 4]; thus the problem has no PTAS assuming Py^NP. 

There have been some important advances on this problem for the corresponding unweighted 
(i.e., uniform weight) problems. Following [5], we use the abbreviation TAP for the unweighted Tree 
Augmentation problem. Nagamochi, see [6], presented the first algorithm for TAP that improved on 
the approximation guarantee of 2; the approximation guarantee is 1.875. Subsequently, Even, et 
ah, [5] built on the ideas and techniques initiated by Nagamochi and presented an elegant algorithm 
and analysis that achieves an approximation guarantee of 1.8. In a conference publication from 
2001, Even, et ah, reported a 1.5 approximation algorithm for TAP, see the extended abstract [3], 
and recently, Kortsarz & Nutov [10] presented the journal version of this result. 

We prove that the integrality ratio of an SDP relaxation (the Lasserre tightening of an LP re¬ 
laxation) is < I + e, where e > 0 can be any small constant. We obtain this result by designing 
a polynomial-time algorithm for TAP that achieves an approximation guarantee of (| -|- e) relative 
to the SDP relaxation. The algorithm is combinatorial and does not solve the SDP relaxation, but 
our analysis relies on the SDP relaxation. Observe that our integrality ratio (and approximation 
guarantee) is proved relative to a “weaker lower bound” than the optimum value (since the feasible 
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region of a relaxation is a superset of the convex full of integer solutions). Moreover, the approxi¬ 
mation guarantee of 1.5 follows as a corollary of our main result (but, we cannot eliminate the “-|-e” 
in our bound on the integrality ratio by this corollary). 

Linear programming relaxations for the weighted version of TAP have been studied for many 
years. (We use the standard abbreviation LP to mean a linear programming relaxation or a linear 
programming problem.) There is an obvious “covering” LP: we have a variable Xg for each edge 
e G E{G) — Et and we have a covering constraint for each edge of T. It is well known that the 
integrality ratio of this LP is < 2; this can be deduced from Jain’s result [7]. A lower bound of 
1.5 on the integrality ratio is known [1]; in fact, the construction for the lower bound uses uniform 
weights for the edges in E{G) — Et, hence, the lower bound applies for TAP. We formulate an LP 
that is a tightening of the obvious “covering” LP for TAP; see (LPq) in Section 3. 

The Lasserre system applies to an initial LP, and it derives a sequence of tightenings of the initial 
LP; these tightened relaxations are indexed by a number t = 0,1,... called the level, where the level 0 
tightening means the initial LP. A key “decomposition theorem” (see Theorem 4.1, [11, 8]) asserts 
that a feasible solution at level t can be written as a convex combination of feasible solutions at a 
lower level such that all of these lower-level solutions y are “locally integral.” Here, “locally integral” 
means that there is a specified subset J C E{G) — Et, such that the solution y takes only zero or one 
values on this subset (i.e., ye G {0,1}, Ve G J). A key point is that the difference in levels (between 
the level t of the given feasible solution and the level of the lower-level “locally integral” solutions) 
does not depend on the size of J, rather, it depends on the following “combinatorial parameter” 
determined by J. Suppose that there exists a constant k such that every feasible solution x of the 
initial LP has < k entries in J that have value one, i.e., |{e G J : Xg = 1}| < k for every feasible 
solution X of the initial LP. Then for any t > k, a feasible solution at level t can be written as a convex 
combination of “locally integral” feasible solutions at level {t — k). This property does not hold for 
other weaker Lift-and-Project systems such as the Lovasz-Schrijver system or the Sherali-Adams 
system. 

The analysis of our algorithm is based on a potential function. Our potential function is derived 
from the Lasserre tightening of the initial LP. 

Our algorithm is “combinatorial” and we do not need to solve the initial LP nor its Lasserre tight¬ 
ening to run the algorithm; but, the analysis of the algorithm relies on the Lasserre tightening. Our 
algorithm essentially follows the “algorithmic scheme” of [5, Section 3.4], see Section 7 for details. 
The algorithm is a greedy-type iterative algorithm that makes a leaves-to-root scan over the tree T 
and (incrementally) constructs a set of links F that covers T. The algorithm starts with F := 0, at 
each major step it adds one or more links to F (it never removes links from F), and at termination, 
it outputs a set of links F that covers T such that |P| < the potential function. The algorithm 
incurs a cost of one unit for each link added to F. The key to the analysis is to show that for each 
major step, the cost incurred (i.e., one plus the number of links added to F) is compensated by a 
part of the potential function. 

Informally speaking, our analysis in Section 8 asserts the following: 

if the naive algorithm gets “stuck” then the instance contains a small combinatorial 
obstruction, a so-called deficient tree, see Theorems 8.7, 8.8. 

This assertion is the key to this paper; it turns out that the algorithmic aspects as well as the 
analysis of the approximation guarantee are straightforward consequences. Our analysis in Section 
8 makes essential use of the Lasserre system and the decomposition theorem, see Figure 1. 

For ease of exposition and for the sake of motivation, this paper has several discussions that 
contain forward references. But, these forward references are not relevant for the correctness of our 
arguments; indeed, all discussions containing forward references could be deleted, and this will have 

2 



Decomposition theorem 
for the Lasserre system 



Potential function 
of Lemma 5.2 

j 



shows deficient trees 
under appropriate conditions 


Figure 1. Illustration of the role of the decomposition theorem for the Lasserre 
system (Theorem 4.1). Our analysis consists of three blocks of results; low-level, 
middle-level, and high-level; these are shown from left to right in the figure. Note 
that Lemmas 7.2, 8.2 are often used in the proofs of subsequent results. 

no effect on the validity of our proofs. Also, Part II repeats some definitions, figures, and discussions 
from Part I, but proofs from Part I are not repeated. 

An outline of Part II is as follows. Section 2 has definitions and notation; several items are new 
to Part II (and not needed in Part I). We adopt the notation and terms of Even, et ah, [5], where 
possible; this will aid readers familiar with that paper. Section 3 presents the initial LP; this is 
the same LP as in Part 1. Section 4 discusses the Lasserre tightening of the initial LP, and proves 
some basic properties and inequalities; most of these results are from Part I, but there are two new 
results. Section 5 derives our potential function, based on a solution y of the Lasserre tightening; 
this potential function differs significantly from the potential function of Part 1. Section 6 starts the 
presentation of the algorithm (and credits) by elaborating on two preprocessing steps. Section 7 
completes the discussion of the algorithm (and credits) by presenting the main loop of the algorithm. 
The most important component of this paper is Section 8; this section presents the analysis of the 
algorithm by first proving some low-level properties, then builds on this to prove some intermediate- 
level lemmas, and then proves the key theorem on deficient trees (Theorem 8.7); this section also 
presents and proves the last piece of the algorithm, namely, the handling of deficient trees. 

2. Preliminaries and notation 

This section presents definitions and notation. 

Standard notation including tree T, link set E. Let G = (F, E{G)) be a connected, undirected 
multigraph, and let T = {V,Et) be a spanning tree of G. We assume that \V\ > 2. By a tree-edge 
we mean an edge of T. Let E denote the edge-set E{G)—Et', we call E the link set and we call an 
element i £ E a, link; thus, a link is an edge of G that can be used to augment T. An instance of 
TAP consists of G and T. We assume that all instances of interest have feasible solutions, that is, 
we assume that (V, Ej^ U E) is 2-edge connected. The goal is to find a minimum-size subset F of E 
such that augmenting T by F results in a 2-edge connected multigraph, i.e., (F, Ft U F) is 2-edge 
connected. 

For two nodes u,v G V, we use Pu,v = Pv,u to denote the unique path of the tree T between u 
and V. 

For a node u G F, we denote the number of tree-edges incident to v by deg 7 i(i;). For any node 
T G F, we use 5e{v) to denote the set of links incident to v. 
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For any U OV, we denote the set of links with both ends in U by E{U), and for any two subsets 
U,W of V, we denote the set of links with one end in U and the other end in W by E(U,W); 
thus, E{U,W) := {uw & E : u G U, w G W}. We use similar notation for some subsets of 
E] for example, E^^^ denotes a particular subset of E (defined below), and E’^^^{U,W) denotes 
{uw G E’-^s : uGU, wGW}. 

We say that a link uw covers a tree-edge e if Pu,w 9 e. Similarly, we say that a subtree of T is 
covered by a set of links J C E if each tree-edge of the subtree is covered by some link of J. For any 
tree-edge e G Ex, we use SEie) to denote the set of links that cover e, thus, ^^(e) = {uw G E : e G 
Pu,W } • 

Shadows and the shadow-closed property. For two links uivi and U 2 V 2 , if Pui,vi — Eu 2 ,v 2 i 
then we call uivi a shadow (or, sublink) of U 2 U 2 . 

For any link uv G E, if all sublinks of uv also exist in E, then we call E shadow-closed. Clearly, 
if E is not shadow-closed, then we can make it shadow-closed by adding all sublinks of each of the 
original links. It can be seen that this preserves the optimal value of any instance of TAP. 

Following Even, at al., [5], we make the next assumption (see Assumption 2.2 of [5]). 
Assumption: E is shadow-closed. 


Root, ancestor, descendant and rooted subtrees. One of the nodes r of T is designated as 
the root; thus, we have a rooted tree (T, r). 

Let u be a node of T. If a node w belongs to the path Pv^, then w is called an ancestor of v, and 

V is called a descendant of w. If a descendant rc of u is adjacent to v (thus, w ^ v), then w is called 

a child of v, and v is called a parent of w. Clearly, every node in V—{r} has a unique parent. If a 

node V has no child, then we call v a leaf] if v has no child, then deg'p(u) = 1. Note that r has at 

least one child (since \V\ > 2), thus, r is not a leaf, even if deg'p(r) = 1. Throughout, we use L to 
denote the set of leaves; thus, L = {v G V : u is a leaf of T}. 

For any leaf v of T, up{v) denotes a node q in P^^r that is nearest to the root and adjacent to v 
via a link. Since E is shadow-closed, it can be seen that up{v) is an ancestor of v, for each leaf v. 

For any node v, we use to denote the rooted subtree of (T, r) induced by v and its descendants. 
We use L(Ty) to denote the set of leaves of the subtree T^. Throughout, the terms tree or subtree 
refer to a rooted subtree of (T, r) that is “closed w.r.t. descendants” (if a node w is in the subtree, 
then all descendants of w are in the subtree). 

Property 2.1. Suppose thatT is a rooted tree. LetT^.^ andTy.^ be two (rooted) subtrees ofT. Then 
and Tj }2 either share no node or one is contained in the other. 

Remark 2.2. Throughout, symbols such as T, J, etc. denote an arbitrary item/set (rather than 
the eomplement of another set). 

Stems and twin links. We call a node s of T a stem if s ^ r, s has exactly two children, s has 
exactly two descendants that are leaves, and there exists a link in E between the two leaves of Tg] 
we call the link between the two leaves of Tg a twin link, and denote it by twinlk{s). (Our definition 
of stem differs slightly from [5, Definition 3.1].) Let denote the set of twin links. Observe that 

there is a one-to-one correspondence between twin links and stems. Throughout, we use S to denote 
the set of stems; thus, S = {v G V : u is a stem of T}. Moreover, we use TZ to denote the set of 
nodes that are neither stems nor leaves; thus TZ = V—{S U L). (The notion of stems and twin links 
is due to [6].) 

For any stem node s, we define (5)^^(s) = {us G E : v ^ V{Tg)}. Similarly, we define (5^”(s) = 
{vsGE:vG V{Tg)}. 
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Contraction and componnd nodes. We use the standard notion of contracting a link or a set 
of links, see [5], [2, Chapter 1], We use T' ;= T/F to denote the tree obtained by contracting 
each of the 2-edge connected components of T -|- F = (V, Et D F) to a single node. Each of the 
contracted nodes of T' = T/F is called a compound node, see [5, Section 3.2]; thus, each compound 
node corresponds to a set of two or more nodes of V{T). Each of the other nodes of T/F is called 
an original node. 

We also call a node of T an original node. We call a link in E an original link. When we say that 
£ is a link w.r.t. T we mean that (. is an original link; when we say that T is a link w.r.t. T' = T/F, 
we mean that there exists an original link i whose image in T/F is T. 

Buds and buddy links. Besides stems and their associated subtrees, one other type of subtree 
(and some of the incident links) plays an important role in this paper. 

We call a leaf bo a bud (see Figure 2) if there exists a (rooted) subtree T^ with exactly three 
leaves bo, bi, 62 such that (i) up{bo) is a descendant of v (possibly, up{bo) = v), (ii) Typj-f,^) contains a 
stem s such that bo, 61 are the leaves of Tg, and (hi) the link 6162 exists. We call 6162 the buddy link 
of bo and denote it by buddylk{bo). Observe that there exists an ancestor g of s in T^ such that 
q is the least common ancestor of s and 62 ; possibly, q = v, and possibly, q = v = r; moreover, 
^i'^up{bo)) '^(^up(6o)) {^0)^1) ^2}’ 

Note that 61 may be a bud as well. In that case, each leaf of is in {60, ^1, &2}) there exists 

a link between bo, 62, and buddylk{bi) = 60^2- 



Figure 2. Illustration of a bud bo and the subtree T„. The figure on the left shows 
the case when up{bo) is an ancestor of all three leaves in T„. The figure on the right 
shows the other case. 

Consider a bud bo, and let s, q be as above; we define 

:= E(n„,,p(,„))-E(F,,,)-{6o}. 

Thus F®P®"^^( 6 o) contains the internal nodes in the tree-path between bo and s; moreover, if up{bo) 
is an ancestor of all three leaves bo,bi,b 2 , then F®p®“^^( 6 o) also contains all nodes on the tree-path 
between the parent of q and up{bo); there are no other nodes in F®p®“^^( 6 o) (see Figure 3). 

Fact 2.3. F®P®“^^(6o) is the set of nodes w such that there exists a link bow in E, and w is not on 
the tree-path between the two ends of buddylk{bo). Every node in F®p®“^^(6o) has a unique child, 
and is an ancestor of bo. For two buds b,b that are not the leaves of the same subtree Tg, where s 
is a stem, the sets F®p®“^^(6) and F®p®"^^(6) are disjoint. (Thus, F®p®“^^(6) nF®P®"^^(6) = 0, unless 
the least common ancestor (in T) of b,b is a stem s.) 

We denote the set of buds and the set of buddy links by T^”'^ and respectively. Observe 

that there is a unique buddy link for each bud; thus, there is a bijection between and . 
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Figure 3. Illustration of for a bud 6o- The nodes in are 

indicated by solid circles. The figure on the left shows the case when T^p^bo) 
three leaves. The figure on the right shows the other case. 

For any node w of the tree T, we denote by the set of buds in the subtree T^. For a 

stem s, note that may contain zero, one, or two nodes. 

Let and Thus, we partition the set 7^ (of nodes 

that are neither stems nor leafs) into two subsets, the “special” subset 7 ^®P®“aJ “normal” 

subset TZ’^onspci^ 

We use to denote the set F—U namely, the set of links that are neither twin 

links nor buddy links. 

Vectors and convex combinations. For any vector x G M^, let ones{x) denote the set of links 
of x-value one, thus ones(x) = {uv G E : Xuv = !}• 

For a vector x G M'® and any subset J of E, x{J) denotes Given several vectors 

v^,v‘^, ..., we write one of their convex combinations as Xliez thus, Z is a set of indices, and 
we have A* > 0, Vi G Z, and ~ 

3. Initial LP relaxation 

This section presents our initial LP relaxation of shadow-closed instances of TAP, denoted by 
(LPo); this is the same LP as in Part I (Section 3). 

Let uiui and U 2 V 2 be a pair of links. We call it an overlapping pair of links if (i) Pui,vi, Pu 2 ,v 2 
have one or more tree-edges in common, and (ii) either an end of uivi is in Pu 2 ,v 2 ^ or an end of U 2 V 2 
is in Pui ,di • We call a set of links J an overlapping clique if every pair of links in J is an overlapping 
pair. We use the notion of overlapping pairs for stating (LPq). 

(LPq) minimize : E ^UV 

uvGE 

subject to : Xuv >1, Ve G Et 

uveS^ie) 

Xuivi + Xu 2 V 2 <1, V overlapping pairs uiVi,U 2 V 2 G E 

0 < X < 1 

4. Lasserre tightening and its properties 

This section presents the Lasserre tightening of the initial LP and proves some basic properties 
and inequalities; most of these results are from Part I, but there are two new results. The proofs of 
the results from Part I are omitted. 
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Let LASi(LPo) denote the level t tightening of (LPq) by the Lasserre system. ^ Recall that ones(x) 
denotes the set of links of x-value one, where x G is a feasible solution of (LPq). Rothvofi, see 
[11, Theorem 2], formulated the following decomposition theorem for the Lasserre system, based 
on an earlier decomposition theorem due to Karlin-Mathieu-Nguyen [8]. (We use this particular 
formulation and not the original statement of [8]; hence, we reference both [8] and [ 11 ]-) 

Theorem 4.1. Let J E. Let k he a positive integer such that |ones(x) (1 J\ < k for every feasible 
solution X of (LPq). Then for every feasible solution y G LASt(LPo), where t > k + 1, y can be 
written as a convex combination y = Xliez such that x* is in LASt_fc(LPo) and x*|j is integral 
(i.e., x\^ is integral for each uv G J), for all i G Z. 

Lemma 4.2. Let J C E be an overlapping clique. For every feasible solution x of (LPq), we have 
|ones(x) n J| < 1. Furthermore, for every level t > 2 and every feasible solution y o/LASt(LPo), we 
have y{J) < 1. 

Lemma 4.3. Let w be a leaf of T, and let u he an ancestor of w such that every internal node 
(if any) of Pw,u has exactly one child. Let e = vu be the tree-edge of Pw,u that is incident to u 
(possibly, v = w). Then, 5 e{o) is an overlapping clique. In particular, 5_B(rc) is an overlapping 
clique. Moreover, we have y{6E{w)) = 1 for any feasible solution y G LASt(LPo) where t >2. 

Recall that the matching polytope of the subgraph induced by the leaves, G{L) = {L,E{L)) is 
given by the following constraints: 

x{6e{l){v)) <1, \/v G L 

x{E{W)) < VWCL, jlTjodd 

X > 0 

The next result is essentially the result on the matching polytope from the survey of Rothvofi, 
see [ 12 , Lemma 13, Sec 3.3], translated to our setting. 

Lemma 4.4. Let e > 0, and let y G LaSj^ ,. (LPq). Then, ig matching polytope of 

G{L) = {L,E{L)). 

Lemma 4.5. Let x be a feasible solution of (LPq) and let s be a stem. Then |ones(x) n (5£;('S)| < 3. 

Proof. Since s is a stem, s is incident to three tree-edges. Let ei, 62 , 63 be the tree-edges incident with 
s. Clearly, J* = SEih) n 5e{s) is an overlapping clique, for each 1 < i < 3, and 5e{s) = Ui<i <3 Jj. 
Then the result follows from Lemma 4.2. □ 

Lemma 4.6. Let t > 3, and let y be a feasible solution of LASt(LPo). Suppose that s is a stem. 
Then: 

(1) y{twinlk{s)) < y{S'ff'{s)). 

(2) For each bud b in Tg (if it exists), we have y{buddylk{b)) < y{bs) -|-y(P( 6 , 

Proof. Let the leaves of Tg be bo and 61 . Let J = 5E{bo) U 5_b(6i). Since bo,bi are leaves, by 
Lemma 4.3, (5_b(6o) and 5E{bi) are overlapping cliques. Thus, by Lemma 4.2, |ones(x) n J| < 2 
for any feasible solution x of (LPq). By Theorem 4.1, y can be written as a convex combination 
AjX* such that x* G LaSi(LPo) and x*|j is integral, Vi G Z. 


^Formally speaking, LASt(LPo) is a set of vectors in . In what follows, we abuse the notation and take 

LASt(LPo) to be the projection on the subspace indexed by the singleton sets; thus, we take LASdPPo) to be a set of 
vectors in R^’^'L 
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(1) : Consider statement (1) of the lemma. Observe that twinlk{s) = b^bi. Since b^bi G J, either 

x^{bQbi) = 0 or a;*( 6 o&i) = 1, for all i £ Z. Let = {i : x*( 6 o 6 i) = 1}. Then 
y{bobi) = kiX^ibobi). Consider x*, where i G ^(i)- Let be the tree-edge between 

s and its parent. Then, x*(5£;(es)) > 1. Notice that every link in 6E{es) with positive 
x*-value must have s as its end in T^; otherwise, if such a link has an end at some other 
node of Tg, then it will be overlapping with the link 6 o 6 i, and we would have a violation 
of the overlapping constraints of (LPq). Thus, x^(dE(es)) = x^(dE(es) n Se(s)). Hence, 
x*((5^^(s)) = x’'(6E(es) n dB(s)) = x^(SE(es)) > 1 = x^(bobi),Vi G Z(iy Consequently, 
y(bobi) = EieZ(i) ^ixHbobi) < EieZ(i) < I2iez (•s))- 

( 2 ) : Consider statement (2) of the lemma. W.l.o.g., let b = bo (the same argument applies for 

b = bi). Let buddylk{bo) = 6162 - Since 6162 G J, either x*( 6 i 62 ) = 0 or x*( 6 i 62 ) = I, for all 
i G Z. Let Z( 2 ) = {i : x*( 6 i 62 ) = !}• Then 2 /( 6162 ) = I]ieZ(2) ^ix''{bib 2 )- 

Consider x*, where i G Zy^y Let (. = b^w be a link such that x*(£) = 1; such a link must 
exist, li w ^ s and w is in Pbi,b2i then £ is overlapping with 6162 , and we would have a 
violation of the overlapping constraints of {LPq). Hence, by Fact 2.3, w G 7^®p®“^^(6o) U {s}. 

Then, x\buddylk{bo)) = 1 < x^{£) < x*( 6 os) + x*(F;(6o,7^®P®“^^(6o))). 

Consequently, y{buddylk{bo)) = J2ieZ(2) ^ix'(bib 2 ) < EieZ( 2 ) Aj(x*( 6 os)-fx*(L;( 6 o,7^®P®‘'“^(6o)))) 
+x'(i?(6o,7^«P'’“"'(6o)))) = y(6os) + y(F;(6o,7^^P^“"^(6o))). 

□ 

For any stem s and for any x G M^, we define slackx(s) to be 

+ X] {x{E{b,n^P^^'^\b))) - x{buddylk{b)))'j + ^ (x((5^'’(s)) - x(twmJk(s))). 

Lemma 4.6 implies the following fact. 

Fact 4.7. For any stem s and for any feasible solution y to LASt(LPo) where t > 3, we have 
slacky{s) > 0 . 

Remark 4.8. Although the above fact assumes that the level t is > 3, observe that slackx{s) is 
well defined for all levels > 0 . 


5. Potential function 

This section presents the potential function used by our algorithm. The potential function is based 
on a feasible solution y to the Lasserre tightening of {LPq). It is essential to tighten {LPq) because 
the potential function is not valid for feasible solutions of {LPq). Possibly, our potential function 
may be obtained via weaker lift-and-project systems (e.g., Lovasz-Schrijver, or Sherali-Adams). We 
prefer to apply the Lasserre tightening because our analysis in Sections 7-8 makes essential use of 
the decomposition theorem (Theorem 4.1), and this theorem is not known to hold for any weaker 
lift-and-project system; see Part I (Section 10). 

Our potential function is defined via a subset of the leaves that is denoted by A. This subset is 
determined by the instance of TAP. Informally speaking, it consists of all the leaves of all occurrences 
of a particular type of subtree, called a bad 2-stem tree, see Section 6. Our potential function consists 
of two parts, a “preprocessing” part and a “normal” part. Our algorithm applies two preprocessing 
steps; the first preprocessing step contracts all occurrences of maximal bad 2-stem trees, and for 
this we have to “charge” the “preprocessing” part of our potential function. 

We say that a set of leaves A is compatible if the following holds: 

• For every twin link and for every buddy link, either both ends of the link are in A or no end 
of the link is in A; in other words, no twin link and no buddy link is present in E{A, L — A). 



In what follows, let A C L denote a compatible set. 

We denote the set of stems with both leaves in A by Sa- Similarly, let denote the set of 
buds in A, and let denote the set of buds in L—A. By the definition of a compatible set, the 

following fact holds. 

Fact 5.1. 

T bud _ I I T bud( \ 

^(L-A) - ^s&S-Sa^ 

E^'^^dylk^L-A) = U,^s-sAbuddyim : b G A'^%s)}. 

Let M^A—a) denote a maximum matching of the subgraph (L—A, A)), and let de¬ 

note the set of nodes of this subgraph exposed by the matching M^A—a)’ ^^us U^^i^a) — {L—A)—{v G 
V : u is an end of some link i G M^A—a)}- 

We mention that our potential function (the right-hand side of the inequality in Lemma 5.2 below) 
refers to the terms A, M^^—a)^ ^(l—A)- Thus, when we use our potential function, we have to ensure 
that these terms have been defined already; we will “fix” our potential function by appropriately 
defining A, M'^^A-Ay ^(L —A) ill Section 6. 

Given A and y G M^, we use lbdy{A) to denote the quantity 

IviEiA)) + ly(F(A, L-A)) + y{E{A, V-L)) + ^ y{5E{s)); 

sdSy 

this is one of the terms in our potential function; informally speaking, this is the main component 
of the “preprocessing” part of the potential function. 


Lemma 5.2. Let e > 0 be a constant, and let t > max{T -j- 1, 3}. Let y G LASt(LPo)- Then, 

(2 + ^)y(-®) - \ + |f^(L-A)l + iddy(A) 

+ ty{E{V-L)) + t Yl, !'(&(iO) + i E v{E{v,V-yy_,^y)))+ E sl»ck,(i). 

y^HnonspcI y^^'J^specml 

Moreover, every item on the right-hand side of the inequality is nonnegative. 

Proof. Observe that y is feasible for LASt(LPo) for t > 3, hence, y >0 and (by Fact 4.7) slacky{s) > 
0,Vs G S. Hence, every item on the right-hand side of the inequality is nonnegative. 

For each link uw, we distribute the value ‘^yuw as follows; 

- if both u,w ^ L, then uw keeps the value ^yuw', 

- if both u,w ^ V—L, then uw keeps the value ^yuw and each of the ends u and w gets value 

^yuw! 

- if only one of u or w is in V—L, then uw keeps the value i/uw and the end in V—L gets value 

(In other words, each node in V—L borrows value ^yuw from each link uw incident to it, and the 
link keeps the remaining value; links uw that are not incident to V—L keep all of the value ^yuw) 
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Thus, we have 

^^y{E) = ^^y{E{K)) + ^^y{E{L-K)) + ^^y{E{KL-K)) 

+ ]^y{E{V-L))+y{E{V-L,K))+y{E{V-L,L-K)) + ]^ Y, y{5E{v)). 

vev-L 

Then we increase the coefficients of the twin links and buddy links in E{L — A) from | to 2 by 
borrowing the value ^y{E^'^^’^{L—A)) + ^y{E^'^'^'^y{L—K)) from the last term above, and adding it to 
the term ^y{E{L—A)). Thus, we replace the term ^y{E{L—A)) by 2y{E{L—A)) — \y{E^^^{L—A)), 
and we replace the last term by 

5 E i'ffef”)) - |!/(£‘“”(r-A)) - 

vev-L 

= ^ X] ^ Y1 y(.^E{v)) + ^ Y yi^E{v)) + ^ Y (yi^E^is))+ y{s'Eis))) 

S&Sa ,,g7^nonspcJ y^'J^special 

- \v(E‘~‘“{L-\)) - \y(E’^'‘'‘'‘>-(L-K)) 

= 5 E »(■'''=(«))+i E !'('*£(>’)) +5 E yiiE(v)) + \ E !/(«”(»))-5!/(-B‘””(r-A)) 

p^'Jlnonspcl y^'J^special 

+ 5 E !/(«"(»))-5</(iJ'’""’'(i-A)) 

s^S S\ 

=T E 5 E !/(fe(!>)) + (5 E !/(£(«. r-tfi“E|{0))+ E !/(-e(6,k‘'‘““'(4)))) 

SGSa y^-J^nonspcI „g7^speciaj f,g£^bud 

(L-A) 

+ 5 E - 59(£"”(i-A)) + t E •/(«!"(»))-5!/(E'’“‘'‘''*'(i-A)) 

sG(S Sj\ sQS iSa 

=T E 5 E !/(fe(!>)) + 5 E y{E(v,V-L^“i^,(v))) 

s€Sa y^-J^nonspcl y^-J^specml 

+ 2 E (y(^Eis))+ Y (yim^^^^'^^’m-yibuddyikm') 

sGS — Sa bGE'^‘^(s) 

+ ^ E {yi^'i(.s)) - y{twinlk{s))^ 

s^S S\ 

= ^ E ^ E y(.^E{v)) + ^ Y y{E{v,V-L’^l±Y'>^y)) + E siacicy(s), 

SGSa PgT^nonspcJ ^gT^speciaJ 

where (2) follows from Fact 5.1 due to the assumption that A is compatible, and (1) follows from 
the following equation: 

U E{w,Lfl±Y^y^ = U 

^gT^specia] 

(L-A) 

This equation is based on some observations. First, the set on the right-hand side is clearly a subset 
of the set on the left-hand side. Conversely, consider a link wa in the set on the left-hand side, 
where w G y^specmi ^ ^ Since w is an ancestor of a and w is not on the tree-path of 
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the link buddylk(a), by Fact 2.3, we have w is in i.e., wa G E(a,7Z^P^^‘^\a)). Hence, the 

link wa belongs to the set on the right-hand side as well. 

Thus, the expression for ^y(E) can be written as 

= ly(E(A)) + |!,(£(A, L-A)) + 2!,(£(L-A)) - 

+ ^y(E(V-L}} + y(E(V-L, A)}+y(E(V-L. L-A)) 


1 

+ 2 


X] ^ Y1 y(.^E{v)) + ^ y{E{v, V-L^^l±j^^{v)))+ Y1 slacky{s) 

S&Sa y^'Jlnonspcl .ygT^specia] 

= (5!/(£(A)) + i!,(E(A, L-A)) + !,(£(A, V-L)) + i !/(fe(s))) 

sGcSa 

+ (!/{E{L-A, A))+2,/{E{L-A)) + y(E(L-A, C-L))) 

+ \y(E(V-L)) + \ Y. !'(fe(”)) + ^ E y{E{v,V-L\f_^ym+ E "»(«) 

V^Tlnonspcl „g7^specia] g^g — g^ 

- ^-y(E'‘HL-A)) 

= ibd,(A)+ E ymv)} + ^y(E(V-L)) 

veL—A 

+ b y(^E(v)) + ^ y(E(v, slacky(s) 


y^'J^nonspc] 


- -y{E^^^{L-K)) 


y^'J^speciai 


sG.S S\ 


By Lemma 4.3, we have y{5E{v)) = 1 for any v G L. Hence, we replace the term J2v&l—a yi^Eiv)) 
(the 2nd term in the displayed equation above) by |L—A|. Moreover, by Lemma 4.4, we have 

is in the matching polytope of {L,E{L)). Thus, is in the matching polytope of 

(L—A, A)), which implies y{E^^^{L—K)) < (l-|-e)|M^^®^^^|. We derive the inequality (stated 

in the lemma) by replacing the term — A)) (the last term in the displayed equation above) 

by -|(1 -k e)|Mjl^^|. Now, observe that |L-A| - ^(1 + = |H(l_a)| + - 

2 (e)|M^^®^^^|, because [/(a—a) = {L—K)—{v G V : u is an end of some link G Note 

that < i|L| = lT.veLyi^E{v)) < y{E), hence, > -ey{E). Thus, we get our 

potential function; 


{^ + e)y{E) 


> 


^|Mff_A)l + |%_A)| +ibd,(A) + ly{E{V-L)) 


+ 2 


\ Y^ yiSEiv)) + ^ Y^ y{E{v,V-L^l±^-^iv)))+ Y^ slacky{s). 


y ^J^nonspcl 


y^J^specia} 


sGtS cSa 


□ 


6. Algorithm and credits L Preprocessing steps 
We state our main result for (unweighted) TAP: 
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Theorem 6.1. Let e > Q he any (small) constant, and let t > max{17, ^ + 1}. Let y denote an 
optimal solution o/LASt(LPo). The integrality ratio o/LASt(LPo) is < § + e. Moreover, there is a 
polynomial-time algorithm for TAP that finds a feasible solution (a set of links that covers the tree 
T) of size < (| + e)y{E). 

For the rest of this paper, we let y G denote an optimal solution of LASt(LPo), where t > 
max{17, 1 + where e > 0 is any constant. Our goal is to show that our algorithm finds a set of 
links that covers T of size < (| + e)y{E). We achieve this goal using our potential function (this is 
the right-hand side of the inequality in Lemma 5.2); we will “fix” the potential function below by 
defining the relevant terms (namely, A, Recall from Part I (Section 6) that the 

potential function provides credit to the algorithm. 

Also, recall from Part I (Section 6) that the combinatorial algorithm is a greedy-type iterative 
algorithm that makes a leaves-to-root scan over the tree T and (incrementally) constructs a set of 
links E that covers T. The algorithm starts with E := 0, at each step it adds one or more links to 
E (it never removes links from F), and at termination, it outputs a cover P of T whose size is < 
the potential function. 

The algorithm incurs a cost of one unit for every link that it picks, and it incurs a cost of one unit 
for each new compound node that it creates in the execution. The key to the analysis is to show 
that for each step, the cost incurred is compensated by a part of the credit. 

We mention that (with one exception, see Lemma 8.3) the nodes or links that get contracted into 
a compound node are no longer relevant for the algorithm or the analysis. In particular, the credit 
(if any) of such nodes or links may be used at the step when they get contracted into a compound 
node, but after that step, any remaining credit of such nodes or links is not used at all. 

For the current tree T' = T/F and nodes u,w of T', let denote the path of T' between u 
and w. By a fitting cover of a rooted tree T( C T' we mean a set of links J that covers all of the 
tree-edges of T( but does not cover any other tree-edge; thus, we have Auw&jP'uw — 'P'v 

6.1. Semiclosed trees. We recall the notion of a semiclosed tree w.r.t. an arbitrary matching. This 
notion is due to Even, et ah, based on earlier work by Nagamochi [6]; also, see [5, Definition 2.3]. 

Let T( be a rooted subtree of the current tree T' = T/F. Let M be an arbitrary matching of the 
leaf-to-leaf links. T( is called semielosed w.r.t. M if the following conditions hold: 

(i) Each link in M either has both ends in T( or has no end in T(. 

(ii) Every link incident to an M-exposed leaf of T( has both ends in T(. 

Let M{T() denote the set of links in M that have both ends in T(. 

We define 

T{M,Tfi) := M{Tfi)\^{up{w)w : rc is an M-exposed leaf in T)]}; 

thus, we associate a “basic link set” with the pair M,T(. In general, the “basic link set” may not 
be a cover of T(. 

By a minimally semiclosed tree T( we mean that T( is semiclosed but none of the proper rooted 
subtrees of T( is semiclosed. 

Lemma 6.2 (Even, et ah, [5]). Let T( he a minimally semiclosed tree w.r.t M. Then T{M,Tfi) is 
a fitting cover ofT). 

6.2. Maximum matching. Our algorithm and analysis are based on a maximum matching of the 
leaf-to-leaf links that are neither twin links nor buddy links. Let M denote one such matching; thus, 
M is a maximum matching of the subgraph (L, P^®^(L)). By an M-link we mean a link that is in 
M. We denote the set of M-exposed leaf nodes by U. 
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The image of M w.r.t. T' is {u'w' : uw G M,u' ^ w'}, where u',w' denote the images (w.r.t. 
T') of the original nodes u,w oiT (i.e., we have u' = u if u is an original node of T', otherwise, u' 
denotes the compound node of T' that contains u] w' is defined similarly). We abuse the notation 
and use M to denote both M and its image w.r.t. T', and by an M-link of T' we mean the image 
(w.r.t. T') of an original M-link. 

For the rest of the paper, unless mentioned otherwise, a semiclosed tree means a subtree that is 
semiclosed w.r.t. the matching M. 

6.3. Bad 2-stem trees. Let T„ be a semiclosed tree rooted at v that has exactly two stems si, S 2 
and four leaves, where we denote the leaves of the tree by u*, re* for i = 1, 2. 

By a leafy 3-cover of T„ we mean a set of three links J such that J covers T^, one of the links in 
J has one end in Ty and one end in L—L{Ty), and the other two links in J have both ends in Ty. 

We call Ty a bad 2-stem tree if (i) one of the links in E{{ui,wi}, {u 2 ,W 2 }) is in M, (ii) two of 
the leaves are M-exposed, (hi) one of the leaves is incident to all the links in E{{ui,wi}, {u 2 ,W 2 }) 
(thus E{{ui,wi}, {u 2 -,W 2 }) has one or two links), (iv) there exists a cover of Ty of size three, and 
(v) there exists no leafy 3-cover of Ty. Let us fix the notation such that wiW 2 is the unique M-link 
in E{L{Ty)); thus, ui and U 2 are M-exposed (see Figure 4). 

By a maximal bad 2-stem tree Ty we mean a bad 2-stem tree that is not a proper subtree of 
another bad 2-stem tree. (Thus, any tree Tq rooted at a proper ancestor q of v (if q exists) must 
violate one of the conditions for being a bad 2-stem tree.) 



Figure 4. Illustration of a bad 2-stem tree. The dashed lines indicate links and the 
thick dashed line indicates an M-link. 

Let = {T^j,... ,Tyi^} denote the set of maximal bad 2-stem trees of T; clearly, any two 

trees in are disjoint, by Property 2.1. We use V{EP^^p) to denote {jrp^^jrprep V(Ty). We define 

that is, A consists of all the leaves of all the trees in EP‘’^p. 

Since each bad 2-stem tree has a cover of size 3, the shadow-closed property implies that each 
bad 2-stem tree has a fitting cover of size < 3. Our algorithm applies a preprocessing step that 
contracts each tree Ty G jrprep |^y ^ fitting cover of size < 3. 

Preprocessing step 1 (A-contraction): For every tree Ty G add a fitting cover of Ty of 

size < 3 to F and contract Ty to a compound node. 

The cost incurred for this step is < since each tree in incurs a cost of < 3 for its 

fitting cover and a cost of 1 for the resulting compound node. 

This cost is charged to one part of our potential function, namely, it is charged to 

lhdy{K) + ^ y{dE{v)). 

y^'Jl’^onspclp;v ( J'P''CP) 

Lemma 6.6 below shows that this quantity is > 4:\EP''^p\. 
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Remark 6.3. The results in this section show that K-contraction is valid, in the sense that the 
algorithm has sufficient credits to pay for the cost of this step. Moreover, A.-contraction is essen¬ 
tial for the overall analysis (i.e., the algorithm cannot skip A-contraction), because the proof of 
Theorem 8.7 (Subcase 2.2) relies on A-contraction. 

Lemma 6.4. Let and A be as defined above. Then, we have 

lbdy{A) + ^ Yj yi^Eiv)) 

^gT^noBspciny ( J-prep) 

> E {ly{miT.))) + lyiE{L{TffiL-L{n)))+y{E{L{n),V-L)) + ^ Y 

T„eEP‘-'=P u&V(T„)-L 

Proof. Recall that ibdj;(A) denotes ^y{E{A))-ir^y{E{A, L—A))-iry{E{A,V—L)) -\-^'^g^^^y{6E{s)). 

Observe that a bad 2-stem tree has no buds, so every node of such a tree is either a leaf, or a 
stem, or a node of Hence, 


(^j^nonspcl p 't/(jrprep)) y 5 a = U inm-L) 

T„&FP'^‘=P 


and so we have 


E yi^Eiv)) + 

v£TZnonspclrV {EP‘'<^P) 


^ E y(^Eis)) 


E ^ E yi^E{u)). 

T^&EP’-^P u&V{T„)-L 


We need the following claim to prove the lemma. 


Claim 6.5. Let Ai, A 2 ,..., A^ be a partition of A. Then, we have 


^y{E{A))+^y{E{A, L-A)) + y{E{A, V-L)) 

„ fc .. k k 

>-Y yiEiA.)) + 2 E + E 

i=l i=l 2=1 

We partition A into the sets A* = L{T.m), where T^j,... are the maximal bad 2-stem trees in 
jrprep^ Then, we have 

lbdy{A) + ^ Y yi^Eiv)) 

^ g T^nocspcJ n V (J-prep ) 

= iy{E(A)) + f„{E{A,L-A))+y{E{A.V-L))+ Y, 5 E 

> E {ly{E{L{T,))) + \y{E{L{TffiL-L{n)))+y{E{L{n),V-L)) + ^ Y 

T„eEP’-'=P u&V{T„)-L 
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To prove the claim, observe that 
^viEiA)) + IviEiA, L-A)) + y{E{A, V-L)) 

„ k 1 ^ 1 ^ ^ 

> (2 E+ 2 EE viEiK^j))) + 2 E+ E^-^)) 

2 = 1 2=1 jf 7^:2 2=1 2=1 

= 5 y{E{A,)) + ^ y{E{K, L-A,)) + ^ y{E{A„ V-L)). 

2 = 1 2=1 2=1 

□ 

Lemma 6.6. Let T„ be a bad 2-stem tree. Then we have 

ly{E{L{T„))) + \y{E{L{T„), L-L{n))) + y{E{L{T„), V-L)) + J y{5E{u)) > 4. 

ueV(n)-L 

Hence, we have 

lhdy{A) + \ Y. y{5E{v))>4\FP^^P\. 

^gT^nonspdny ( J-prcp) 

Proof. The second statement follows immediately from the first statement and Lemma 6.4. We 
focus on the first statement. 

Let si,S 2 denote the two stems in r„, and let ui,wi (respectively, U 2 ,W 2 ) denote the two leaves 
in Ts, (respectively, Ts,). Thus, L{Ty) = {ui,wi,U 2 ,W 2 }. W.l.o.g. let W 1 W 2 denote the unique 
M-link in E(L(Ty)); the two twin links uiwi, U 2 W 2 are also in E{L[T„)), and E{L(T„)) may contain 
one other link incident to wi or W 2 ‘, there is no link between m and U 2 (see Figure 4). Note that 
every link incident to ui or U 2 (the M-exposed leaves of T^) must have both ends in T„, since is 
semiclosed w.r.t. M. 

Let J denote the set of links incident to the leaves of r„, thus, J = Uj=i 2 {^E{ui) U (5_B(rci)^. For 

any feasible solution x of (LPq), Lemmas 4.3 implies that |ones(x)n J| < 4. Hence, by Theorem 4.1, 
y can be written as a convex combination such that x* G LaS 3 (LPo) and x*| j is integral, 

Vi G Z. (In what follows, we have to apply Lemma 4.6 to x*, hence, x* must be feasible for level 3 
(or higher) of the Lasserre tightening.) 

Thus, it suffices to show that for any i £ Z, we have 

E(p(L(T,))) +E(p(L(r,),L-L(r,))) +p(p(L(r,),H-L)) + 2 J] x\Se{u))>4- 

ueViTV-L 

let a denote the left-hand side of the above inequality. 

Observe that every link in J with positive x*-value must have x*-value one. Suppose that one of 
the links i G E{L{T„)) has positive x*-value, thus x*(£) = 1; then, we have |x*(P(L(r^))) > |, thus I 
contributes value | to a. Also, if one of the links £ with one end in L{T„) and the other end at a non¬ 
leaf node of r„ has positive x*-value, then we have x*(P(L(Tt,), H—L))^ X]ite(y(r„)—L) x^{5e{u)) > 
1 -I- ^ = |, thus, £ contributes value | to a. 

We complete the proof by case analysis, by considering the number of links with positive x*-value 
such that one end is in L{Ty) and the other end is not in Ty. 

Case 1.: Suppose that there are no links with positive x*-value such that one end is in L{Ty) and 
the other end is not in Ty. Then, we focus on the number of links in E{L{Ty)) that have 
positive x*-value; this number is zero, one, or two, because every link incident to a leaf 
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has a:*-value zero or one, and moreover, x^{6e{u)) = 1 for each leaf u of r„, by (LPq) and 
Lemma 4.3. Thus, we have three subcases. 

Subcase 1.1.: x^{E{L{Ty))) = 0. Every link of x*-value one incident to a leaf of Ty has its 
other end at a non-leaf node of Ty, hence, each such link contributes | to a; we have 
four such links, so a > 6. 

Subcase 1.2.: x^{E{L{Ty))) = 1. The link i in E{L{Ty)) with a:*-value one contributes | 
to a. The two leaves in L{Ty) that are not incident to £ each contribute | to a, because 
each is incident to a link of x*-value one that has its other end at a non-leaf node of Ty, 
hence, a > |. 

Subcase 1.3.: x^{E{L{Ty))) = 2. Since x^{5e{u)) = 1 for each leaf u of Ty, the links in 
E{L{Ty)) with x*-value one share no ends. By the definition of bad 2-stem trees, all the 
links in E{{ui,wi}, {u2,W2}) are incident to one of the four leaves. It follows that the 
twin links uiwi and U 2 W 2 both have a;*-value one, and each contributes | to a. Next, 
focus on the stems in Ty] by Lemma 4.6, x^{5'^{sj)) > x^{ujWj) = 1 for j = 1, 2. Thus, 
each stem contributes at least ^ to a via the term ^J2u&{v{Ty)—L)^^i^E{u)). Hence, 
a>2(|) + 2(i) = 4. 

Case 2.: Suppose that there is at least one link with positive x*-value such that one end is in L(Ty) 
and the other end is not in Ty. Again, we focus on the number of links in E{L(Ty)) that 
have positive x*-value; it can be seen that this number is zero or one. (Note that every link 
incident to a leaf has x*-value zero or one, and moreover, for every leaf u of Ty, we have 
x^{5e{u)) = 1.) Thus, we have two subcases. 

Subcase 2.1.: x^{E{L{Ty))) = 0. Every link of a:*-value one incident to ui or U 2 (the M- 
exposed leaves of Ty) has its other end at a non-leaf node of Ty, hence, each such link 
contributes | to a; we have two such links. Every link of x*-value one incident to wi 
or W 2 (the M-covered leaves of Ty) has its other end either in V—L, and so contributes 

1 to x^{E{L{Ty), V—L)), or else has its other end in L—L{Ty), and so contributes ^ to 
^x'^{E(L{Ty), L—L{Ty))); again, we have two such links. Hence, a > 2(|) -|- 2(^) = 4. 

Subcase 2.2.: x^{E{L{Ty))) = 1. Let £y denote a link of positive a;*-value such that one 
end is in L{Ty) and the other end, denoted by q, is not in Ty. Note that x^(£y) = 1. 
The end of £y in L{Ty) must be M-covered; w.l.o.g. assume that this node is wi. 

Note that x^{5e{u)) = 1 for each leaf u o£ Ty. Since Ty is a bad 2-stem tree, the 
links in E{L[Ty)) that are not incident to wi are the twin link U 2 VJ 2 and possibly the 
link U 1 W 2 (there is no link between ui,U 2 ). Thus, either x^{u 2 W 2 ) = 1 or x^{uiW 2 ) = 1, 
and this contributes | to a via the term ^x^{E{L{Ty))). In either case, one of the 
M-exposed leaves ui or U 2 is incident to a link of x*-value one that has its other end at 
a non-leaf node of Ty, and this contributes | to a via the term x^{E{L{Ty), V—L)) + 

2 Yju&(V{Ty)-L) x''{5e{u)). 

We have two subcases, depending on whether q (the end of £y in V—V(Ty)) is a leaf 
or not. 

Subcase 2.2.1.: q ^ L. Then, the link £y = wiq contributes 1 to a via the term 
x^{E{L{Ty), V—L)). Then, we have a>|-|-§ + l = 4. 

Subcase 2.2.2.: q ^ L. Then, the link £y = wiq contributes ^ to a via the term 
lP{EiL{Ty), L-L{Ty))). 

Moreover, observe that we cannot have x^{uiW 2 ) = 1, otherwise the three links 
£y = wiq,uiW 2 ,U 2 W 2 form a leafy 3-cover of Ty (see Eigure 5(a)); recall that a 
bad 2-stem tree cannot have a leafy 3-cover. 
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Figure 5. Illustration of leafy 3-covers of Ty for Subcase 2.2.2 in the proof of Lemma 
6.6. The dashed lines indicate links and the thick dashed lines indicate M-links. 


Thus, U 2 W 2 is the unique link in E{L(Ty)) with x*-value one. Now, focus on the 
stem S 2 ; by Lemma 4.6, > x^{u 2 W 2 ) = 1- Thus, 5g’(s2) contributes at 

least ^ to a via the term ^ J2u&{v{Ty)—L) 

Finally, note that U 1 S 2 is not present; otherwise, we would have a leafy 3- 
cover consisting of the (hypothetical) link U 1 S 2 and the links iy = wiq,U 2 W 2 (see 
Figure 5(b)). Hence, the link of x*-value one incident with ui is not in 5^^(s2)- 
Thus, we have a>| + | + 5 + 5=4. 

This completes the case analysis, and completes the proof. □ 

6.4. Credit assignment for the algorithm and the preprocessing. Recall that the algo¬ 
rithm starts with a number of credits equal to the potential function, namely, the right-hand side 
of the inequality in Lemma 5.2. In order to specify the potential function, we need to specify 
A)’ ^(L—A)- We take to be M n E{L—A), i.e., the restriction of M to the subgraph 

(L—A, E’^^^{L—A)). It can be seen that M n E{L—A) = M—E{A) is a maximum matching of the 
subgraph (L—A, E^^^{L—A)), as required by the definition of in Section 5. We take 

to be 1/ n (L—A); again, this agrees with the definition of in Section 5. 

The cost of A-contraction (Preprocessing step 1) is 

< lbdy{A) -k ^ Yj yi^E{u)), 

.^gT^nonspdny ( J-prcp) 

where is defined above. We subtract this quantity from our potential function, and then add 

the credit of the compound nodes formed by contracting the trees Ty G j:prep ^ remaining 

potential function (i.e., total credits available) for the rest of the execution. Clearly, the remaining 
potential function is 

^|M n E{L-A)\ + \un (L-A)l + + 

\y{E{y-L)) + ]^ Y y{^E{u))+'^ Y y(-^(^’’^-^(L-A)(“))) + slacky(s) 
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By the integral potential function we mean the sum of the first three terms above (namely, 
||M n E{L—A)\ + \U n (L—A)| + and by the fractional potential function we mean the sum 

of the remaining terms (namely, the sum of the four terms that use y). 

The following observation is useful for simplifying our notation. 

Fact 6.7. For the current tree T' (after k-contraction), U n (L—A) is the same as the set of Ad- 
exposed original leaves, and Ad n E{L—A) = Ad{T'). 

We start with the credit given by the integral potential function, and we maintain the following 
assignment of credits to the nodes of T' and the links of M{T') = M n E{L—A): 

• every M-exposed original leaf has one credit, 

• every compound node has one credit, 

• every link of Ad{T') has | credit, and 

• the root r has one credit. 

It can be seen that the integral potential function suffices for assigning credits to the tree T' that 
results from A-contraction. (See Part I (Section 6) for a discussion on the unit credit for the root 
r.) 

We define the integral credit of a set of links J (w.r.t. T') to be the sum of the credits of the 
M-links pq such that V{Pp^q) C U«u,eJ plus the sum of the (integral) credits of the nodes 

in Un«,eJ V^(T(^ .^), plus one if r occurs as an original node m 

In other words, the integral credit of J is the sum of | times the number of M-links pq such that 
y{Pu^yj), plus the number of compound nodes in IJnt«eJ yiPu,w)^ pins the number 
of M-exposed original leaves in the same set, plus one if r occurs as an original node in P'uw 

Informally speaking, when a step of the algorithm contracts all the links in J, then this amount of 
integral credit is available for this step (but not for subsequent steps). 

Now, consider the fractional potential function. We use it to maintain an assignment of frac¬ 
tional credits to the (rooted) subtrees of T'. For any subtree T( of T', observe that V{T() n 
(j^nonspci_v(^jrprep)) _ y(T() fl because none of the original nodes in is present 

in T' after the A-contraction; similarly, we have V(T() n (S—Sa) = y{T() n S, because none of the 
stems in Sa is present in T' after the A-contraction. For any subtree T( of T' and for any vector 
X G M'®, we use ^{x,T() to denote 


]^x{E{{V-L) n F(T'), {V-L) n {V(T')-V{T())))+ 

\ x{6e{u))+'^ Y x{E{u,V-L'(f_A){u)))+ Y s-^ack,,(s). 

^tey(T.^)n7^“™=p=^ s&v{T^)r\S 

The first term is defined on links that have both ends at original non-leaf nodes of T', and 
moreover, have exactly one end in T(. Similarly, the last three terms denote credits that are assigned 
to original nodes in T(. We mention that the credits given by these four terms cannot be used two 
or more times by the algorithm. This holds because all credit (integral or fractional) associated 
with an original node of T' can be used by a step that first contracts the original node to a new 
compound node; after that, none of the credit associated with this original node is available (any 
original node contained in a compound node is essentially “invisible” to the algorithm). 

Informally speaking, <I>(y, Tf) denotes the fractional credit of T(, that is, the part of the fractional 
potential function that is “owned” by T(. The fractional credit of T( will be used together with its 
integral credit for contracting T(. 
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6.5. Second preprocessing step. We apply a second preprocessing step after the A-contraction 
and before the main loop of the algorithm. It turns out that the set of nodes contracted by this step 
is disjoint from the set of nodes contracted by the A-contraction. 

Let bo be a bud, let 6162 = buddylk{bo), and let s denote the stem such that bo is a leaf of T^; 
w.l.o.g. let 61 be the other leaf of Tg. Moreover, if bi is also a bud, then we assume w.l.o.g. that 
up{bo) is an ancestor of up{bi). 

Preprocessing step 2: For each bud bo and the two ends 61 , 62 of buddylk{bo) such that all three 
of bo,bi,b 2 are M-exposed, we contract the two links up{bo)bo and 6162 . 

Fact 6 . 8 . Preprocessing step 2 has sufficient credits. 

Proof. Consider one “triple” consisting of a bud 60 and the two ends 61,62 of buddylk(bo} such that 
^ 0 ) bi, 62 are M-exposed. Note that each of the M-exposed leaves bo, bi, 62 has 1 unit of credit. Thus, 
we have sufficient credit for contracting two links and forming one compound node. □ 

Remark 6.9. The results in this section show that Preprocessing step 2 is valid, in the sense that 
the algorithm has sufficient credits to pay for the cost of this step. Moreover, Preprocessing step 2 
is essential for the overall analysis (i.e., the algorithm cannot skip this step), because the proof of 
Lemma 8.3 relies on this step. 

7. Algorithm and credits II: Overall algorithm 

We present pseudo-code for the overall algorithm, after discussing some preliminaries. Also, we 
state and prove several assertions, i.e., we prove some basic properties maintained by the algorithm. 
These assertions are critical for the analysis in the next section. 

7.1. (Up-to-5) greedy contractions and assertions on M. Recall that the integral credit of a 
set of links J is given by the credits of the M-links pq such that V{Pp g) C IJtjuieJ y{PL,w): plus the 
sum of the (integral) credits of the nodes in Uuuie J ^(^u w)^ plus one if r occurs as an original node 
lu Untog J Pu,w 

We dehne an (up-to-5) greedy contraction to be a contraction of a set of links J such that 

(i) Ul<5; 

(ii) the contraction of J results in a single compound node, i.e., Unn^gj-^nto forms a connected 
graph; 

(iii) the integral credit of J is > | J| -|- 1. 

Clearly, if an (up-to-5) greedy contraction is applicable, then, in polynomial time, the algorithm 
can find a set of links J that is eligible for this step (by examining all link sets of size < 5). 

Remark 7.1. Clearly, the algorithm has sufficient credit for every application of (up-to-5) greedy 
contraction. The overall analysis relies on repeatedly applying (up-to-5) greedy contractions until 
no such contractions are applicable. In fact, every lemma/theorem in Section 8 assumes that no 
(up-to-5) greedy contractions are applicable. 

The following assertion is similar to Lemma 6.3 of Part 1. 

Lemma 7.2 (Assertions on M). Suppose that no (up-to-5) greedy contractions are applicable. Then: 

(1) For every M-link uw, every node in is an original node. In particular, w.r.t. T', both 
ends of each M-link are original leaf nodes. 

(2) There exist no links between M-exposed leaves ofT'. 
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7.2. Good semiclosed trees. Let be a (rooted) subtree of T’. Let U{T^) denote the set of 
M-exposed leaves of (including both original nodes and compound nodes). Let C'(T^) denote the 
set of compound non-leaf nodes of T^. Recall that M(T^) denotes the set of M-links that have both 
ends in T^. Note that every node in {V{T^) n TZ) y]iy{T'^) n S) is an original node. 

Recall that a semiclosed tree means a tree that is semiclosed w.r.t. the matching M, unless 
mentioned otherwise. After the preprocessing steps, whenever we mention semiclosed trees, we 
assume that no (up-to-5) greedy contractions are applicable (see Section 7.1). Then, Lemma 7.2(1) 
implies that M is a set of leaf-to-leaf links w.r.t. the current tree T'. Hence, semiclosed trees (of T' 
w.r.t. M{T')) are well defined. 

We define the credit of a (rooted) subtree T(, of the current tree T' to be the sum of the fractional 
credit of T^, namely, <I>(y,T(|), and the integral credit of T^. The latter is given by the sum of the 
following terms; ||M(T(|)|, the number of compound nodes in T^, the number of M-exposed original 
leaves in and an additional one if the root r is in V{T^) n TZ. We call a semiclosed tree T)) good 
if its credit is > \T{M,T^)\ + 1. 

The next result can be proved using arguments similar to the arguments used for proving Lemma 6.4 
of Part I (although the potential function and credits in Parts I and II are different). 

Lemma 7.3. Let be a semiclosed tree. If at least one of the following conditions is satisfied, then 
is good. 

» T' =T' 

• c\Tf) / 0 

• |M(T')| >2 

• HyX)>^ 

. |M(r')| = l and<^{yX)>\- 

7.3. Summary of the algorithm. We start with F := tj) [F is the set of links picked by the 
algorithm) and T' := T {T' is the current tree T/F). 

apply Preprocessing step 1 (A-contraction); 

apply Preprocessing step 2; 

while T' is not a single node do 

repeatedly apply (up-to-5) greedy contractions until no such contractions are applicable; 
find a good semiclosed tree with a fitting cover J of size |r(M, T(,)| (Algorithm 2 in 
Section 8 presents the details for finding such a semiclosed tree); 
add J to F, contract to a new compound node, update T'; 

end 

Algorithm 1: Find an approximately optimal solution for TAP. 


7.4. Stem assertion of the algorithm. This section presents a basic assertion that is important 
for our analysis. 

Recall that the algorithm iteratively contracts a set of links such that the tree-paths associated 
with these links form a connected graph; the set of chosen links and their associated tree-paths are 
contracted to form a new compound node. When we say that a contraction hits a node v of T', we 
mean that a step of the execution contracts a set of links J such that v is in Umiosj yiKX thus, 
at least one of the tree-edges incident with v is covered by one of the links in J. 

Stem assertion: Let s be a stem. The first contraction that hits a node ofTg (during the execution 
of the algorithm) must hit s. 

Property 7.4. The algorithm maintains the stem assertion. 
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Proof. Let s be a stem. First, consider the two preprocessing steps. Either all of the tree Tg is 
contracted or none of the tree-edges of Tg is contracted by Preprocessing step 1 (A-contraction). 
The same statement holds for Preprocessing step 2. Hence, the stem assertion is maintained by the 
two preprocessing steps. 

Next, suppose that the first contraction (in the execution) that hits a node in Tg is an (up-to- 
5) greedy contraction that contracts a set of links J. If one of the links uw £ J has one end 
in Tg and the other end not in Tg, then the tree path (in the current tree w.r.t. the first 

contraction that hits a node of Tg) must contain s, hence, the stem assertion is maintained. The 
remaining possibility is that all links of J have both ends in Tg. Then observe that the number of 
integral credits available in Tg is either zero, one, or two, and, in the last case, both leaves of Tg are 
M-exposed. (By definition of stem, the root r is not in Tg, since r 7 ^ s.) The greedy contraction of 
J requires | J| + 1 integral credits. Thus, | J| = 1 and J contains the twin link of s. Contracting the 
twin link clearly maintains the stem assertion. 

Finally, suppose that the first contraction (in the execution) that hits a node in Tg is the con¬ 
traction of a good semiclosed tree T)). By Property 2.1, one of Tg, T^ is contained in the other. It is 
easily seen that no proper subtree of Tg is a semiclosed tree. (Note that there is a sublink between 
each leaf of Tg and s via the twin link of s, hence, any semiclosed tree containing a leaf of Tg that 
is M-exposed will contain s too. Also, any semiclosed tree containing a leaf of Tg that is M-covered 
will contain s too.) The only remaining possibility is that T^ contains Tg] then, the contraction of 
Ty maintains the stem assertion. □ 


8. Analysis of the algorithm and deficient trees 

This section has our main results. Informally speaking, the key result (Theorem 8.7) asserts the 
following: if a semiclosed tree T^ is not good, then either T^ is a deficient tree (defined below) or T^ 
is a particular type of tree that is easily bypassed by our analysis. 

The analysis consists of two parts. In Section 8.2, using local integrality of feasible solutions to the 
Lasserre system, we show that all semiclosed trees are good, except for a few cases. The nontrivial 
cases give deficient trees. Section 8.3 shows how to handle deficient trees. This leads to an efficient 
algorithm for finding a good semiclosed tree together with a fitting cover of appropriate size. 
Deficient 3-leaf tree: Suppose that is a semiclosed tree with exactly three leaves 0 , 61 , 62 . 
Clearly, among the nodes w of T(, either there is exactly one node with deg 2 ’/(rc) = 4 or there are 
two nodes with degree 3 in Tk In the latter case, we denote these two nodes by u and q] moreover, 
we fix the notation such that u is an ancestor of q, and the leaf 61 is not a descendant of q] thus, 
0,62 (but not 61 ) are descendants of q. In the former case, we denote by u the unique node that 
is incident to four tree-edges. We call T^ a deficient 3-leaf tree (see Figure 6 ) if (i) the link 6162 is 
present and it is in M[T'), (ii) the link a 6 i is present, and (iii) there exists a link b 2 W such that 
w£V{T')-V{T:,). 

Moreover, in the first case (with a unique node u in T^ with deg 2 ’/(u) =4), if conditions (i)-(iii) 
hold with both labelings ( 61 , 62 ) and ( 62 , 61 ) of the M-link, then we fix the notation such that up{b 2 ) 
is an ancestor of up( 6 i). For both cases, we call 62 the ceiling leaf of Tfi 

Deficient 4-leaf tree: Let T^ be a semi-closed tree with 4 leaves and exactly one stem node s 
such that all nodes in Tg are original, \M{T^)\ = 1, and the link in M{Tfi) has exactly one end in Tfi 
Let the four leaves of be a, 61 , 62 , c, let the two leaves of Tg be a, 61 , and let the link in MfiTfi) be 
6162 . Note that c is the M-exposed leaf of T^ that is not in Tfi Let p be the least common ancestor 
of s and c. Then, T^ is called a deficient A-leaf tree (see Figure 7) if (i) contains all leaves of T^, 
(ii) the link cs is present, and (iii) there exists a link b 2 W such that w G V{T')—V{Tf). We call the 
link cs the latch of Tfi 
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Figure 6 . Illustration of deficient 3-leaf tree. The dashed lines indicate links and 
the thick dashed lines indicate M-links. 


The contraction of the latch cs in a deficient 4-leaf tree results in a deficient 3-leaf tree due to the 
presence of the links 061 , 11 ^( 62)62 (see Figure 6 (a)). Let 6 be the ceiling leaf of the resulting tree. 
Clearly, up(b) is an ancestor of up{b 2 )- 




Figure 7. Illustration of deficient 4- leaf trees. The dashed lines indicate links and 
the thick dashed lines indicate M-links. 


8.1. Properties from assertions. We start with an observation on compound leaf nodes. The 
next result (Lemma 8.2) states some properties pertaining to stem nodes and semiclosed trees; these 
properties are often used in the analysis of the algorithm. The proof relies on the stem assertion. 
After that we present a key lemma that pertains to some buds (Lemma 8.3); see the discussion 
preceding that lemma. 

Fact 8.1. If a compound leaf node contains a node u G y{T), then that compound node contains 
Tu. 

Lemma 8.2. Suppose that no (up-to-b) greedy contractions are applicable. Let T'^ he a semiclosed 
tree with C{Ty) = 0. 

(1) IfT^ contains a stem node s, i.e., s G S r\ V{T^), then every node in is original. 

(2) In the original tree T, suppose that s is a stem node, and w is a leaf ofTg. If w is contained 
in a compound node (c) that is an M-exposed leaf ofT^, then all nodes of Tg are contained 
in (c). 

(3) In the original tree T, suppose that s is a stem node, and w is a leaf of Tg. Ifw is an original 
M-exposed leaf ofT^,, then s is an original node ofT^. 
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Proof. The first part follows from the stem assertion. 

Consider the second part. By the stem assertion, and the fact that w is contained in (c), it can 
be seen that s is contained in some compound node. If s is contained in (c), then the proof is done 
by Fact 8.1. Now, suppose that s is contained in a different compound node (a). Then, there exists 
a link between (c) and (a) (w.r.t. T'), because the link ws is present in E (the input) since ws 
is a sublink of twmlk{s). It can be seen that contains (a) as a leaf, because is semiclosed, 
C{T[f) = 0, (c) is an M-exposed leaf of T(|, and the link between (c) and (a) is present. This gives a 
contradiction because the compound leaf node (a) contains s so it contains Tg (by Fact 8.1), hence, 
(c) cannot contain w. 

The third part follows from arguments similar to that used for the second part; we give a sketch. 
Suppose that s is contained in a compound node (a). If (a) is a leaf of T', then (a) would contain 
the subtree Tg (by Fact 8.1), and this would contradict the fact that the leaf w is an original node. 
Since C{Tf) = 0, (a) cannot be a non-leaf node of T'^. Thus, T!^ contains w, but it does not contain 
(a). This contradicts the fact that is semiclosed, because there is a link between the M-exposed 
leaf w of and (a) (due to the sublink ws of twinlkfs)). □ 

The next lemma addresses a subtle point in our analysis. Consider a semiclosed tree and 
its fractional credit ^{y,Tl), and observe that the links between a bud b in and a node in 
V{T^) n7^®P®“^^(6) do not contribute to the fractional credit; see the third term in the definition of 
(the last displayed equation in Section 6.4). Nevertheless, under appropriate conditions, 
we can exhibit fractional credits even when such links are present (see Lemma 8.5 and its proof). 
In order to do this, we have to focus on buds contained in M-exposed compound leaves such that 
there exists a link between the bud and a node “outside” the compound leaf (this is one part of 
the analysis that examines an original node even after it has been contracted into a compound 
node). The next lemma gives a tool for “counting” such buds, and the lemma is used twice in the 
subsequent analysis. The first use is in the proof of Lemma 8.4 to show that a semiclosed tree 
with \M(T^)\ < 1 has 0(1) such buds, and the second use is in the proof of Lemma 8.6 to show that 
a semiclosed tree with M{T^) = 0 has no such buds. 

Lemma 8.3. Suppose that no (up-to-5) greedy contractions are applicable. Let T(, be a semielosed 
tree with C{Tf) = 0. Let bo be a bud that is contained in an M-exposed compound leaf (c) of 
such that there exists a link £ ineident to bo that has exactly one end (namely, bo) in (c). Let 
huddylk{bo) = 6162 ? and let bibo be the twin link (w.r.t. T) incident to bo. Then, 62 is an original 
leaf node ofT( and 62 is incident to a link of MiT(). 

Proof. Let the original ends of £ be bo, w; thus, w is not contained in (c). Let s be the stem associated 
with 60 ; note that the leaves of Tg are bo,bi. By Lemma 8.2(2), Tg is contained in (c). Since bo is 
contained in an M-exposed leaf (c) of T(, C{T() = 0, and T( is semiclosed, either up{bo) is contained 
in (c), or up{bo) is an original non-leaf node of T(. Clearly, (c) cannot contain up( 6 o), otherwise, (c) 
would contain (by Fact 8.1), hence, the link bow would be contained in (c). Thus, up{bo) is 

an original non-leaf node of T(. 

Consider the node 62 - Suppose that 62 is an original leaf node of T'. Then, 62 is in T( (since T( is 
semiclosed), and 62 is M-covered (by Lemma 7.2(2) and the existence of the link 6162 )- Moreover, 
both ends of the M-link incident to 62 are in T( (since T( is semiclosed). Thus, the lemma holds if 
62 is an original leaf node of T'. 

Now, we may assume that 62 is contained in a compound node (a). The hypotheses of the 
lemma imply that (a) is a leaf in T(. We must have (a) = (c). Otherwise, either (a) is M-covered, 
contradicting Lemma 7.2(1), or (a) is M-exposed, contradicting Lemma 7.2(2) because the link 6162 
implies a link between the M-exposed leaves (a), (c). We conclude that (c) contains all three of 
bo, bi, 62 - In what follows, we use this to derive a contradiction to the fact that up{bo) is an original 
non-leaf node in T(. 
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By Preprocessing step 2 , one of bo, bi, 62 is M-covered w.r.t. T, otherwise, the preprocessing step 
contracts two links to form a compound node that contains up{bo). We have two cases, depending 
on the M-links incident to 60,61,62 w.r.t. T. 

Case 1; There exists an M-link w.r.t. T that has one end 6 G {60,61,62} and the other end u G 
L—{60, 61, 62}. If (c) does not contain u, then we get a contradiction to Lemma 7 . 2 ( 1 ) (M- 
link incident to compound node). Otherwise, (c) contains u, hence, (c) contains Pb,u'-, note 
that up(6o) is in Pb^u-, so (c) contains up{bo). This is a contradiction. 

Case 2 : Each M-link w.r.t. T incident to one of bo, 61,62 has both ends in {60, 61,62}. There is only 
one such M-link, namely, 6062, since the twin link 6061 and the buddy link 6162 cannot be 
in M. Then bo is an M-covered bud w.r.t. T (see Figure 8). 



Figure 8. Illustration of an M-covered bud w.r.t. T. 

We may assume that 6062 G M and 61 is M-exposed. Suppose that up(6i) is a descendent 
of up(6o); then, 61 is a bud; this follows from the definition of bud (see Section 2 ). Then, we 
get a contradiction since 6062 G M and 6062 = buddylk(bi) (M has no buddy links). Hence, 
up(bi) is a proper ancestor of up(bo)- Thus, up(bo) / r, and no (rooted) subtree of Tup(bo) 
is semiclosed since 61 is M-exposed. 

The following claim gives a contradiction and completes the proof of this lemma. 

Claim. Suppose that 6062 is an M-link and 61 is M-exposed. Then, the first 
contraction that hits a node of Tup(feo) (during the execution of the algorithm) 
must hit up(6o). 

First, consider the two preprocessing steps. Preprocessing step 1 contracts all bad 2 -stem 
trees. Note that no leaf of a bad 2 -stem tree is a bud, hence, by Property 2 . 1 , Tup(bo) is 
disjoint from every bad 2 -stem tree. Preprocessing step 2 applies when a bud 6 and the two 
ends of buddylk(b) are all M-exposed. Since bo is M-covered, Preprocessing step 2 does not 
apply to 60,61,62. Thus, the contractions in the preprocessing steps do not hit any node of 

'^up(bo) 

Next, suppose that the first contraction (in the execution) that hits a node in Tup(bo) is 
an (up-to- 5 ) greedy contraction that contracts a set of links J. If one of the links uw G J 
has one end in Tup(6o) and the other end not in then the contraction must hit up(6o), 

because the tree path P^^ (in the current tree at that contraction) contains up(6o). Thus, 
the claim holds. The remaining possibility is that all links of J have both ends in 
But observe that the number of integral credits available in Tup(f,Q) (in the current tree at 
that contraction) is exactly | (due to one M-link and one M-exposed node), and this does 
not suffice for contracting two or more links; moreover, no single-link contraction applies 
because no singleton set J has integral credit > |. 
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Finally, suppose that the first contraction (in the execution) that hits a node in is 

the contraction of a good semiclosed tree T)),. By Property 2.1, one of T^p(bo)i is contained 
in the other. Since no (rooted) subtree of Tj^p(bo) is semiclosed, it follows that contains 
Tup{bo)- Then, the contraction of T(), hits up( 6 o). This proves the claim, and completes the 
proof of the lemma. 

□ 

8.2. Most semiclosed trees are good. Let T(, be a semiclosed tree. Let denote the 

set of M-covered leaves of T^. 

We call a compound node (c) of open if it is an M-exposed leaf of T^, and moreover, (c) 
contains a bud bo such that there exists an original link between bo and an original node that is 
not contained in (c). We call such a bud an open bud (it must be contained in an open compound 
node and one of the original links incident to the bud is not contained in the compound node). Let 
Qcomp(rpr^ denote the set of open buds of T(,. Let B°"^(T)() denote the set of original nodes of 
that are M-exposed buds. Let B(T^) = B°"^?(T^) U B™“P(r^). 

Lemma 8.4. Suppose that no (up-to-b) greedy contraetions are applicable. Let be a semiclosed 
tree with C{T^) = 0, \M{Tl)\ < 1 and \S n V{T^)\ < 2. 

Then, y can be written as a convex combination such that x* G LaS 3 (LPo) and x*| j is 

integral, where J = {i £ dE{u) : u G (S D V{T^)) U ^ 

Proof. We claim that |ones(x) H J| <14 for any feasible solution x of (LPq). The conclusion (of the 
lemma) follows easily from Theorem 4.1 and this claim. 

To prove the claim, observe that |5 n V{T^)\ < 2 and |L™®*^^®'^(r^)| = 2\M{T^)\ < 2, hence, by 
Lemmas 4.3, 4.5, we have 

|ones(x) n {£ £ Se{u) : u £ S D V{T^)}\ < 6 , |ones(x) r\{£ £ 5 e{u) : u £ < 2; 

moreover, for each u £ we have |ones(x) H Se{u)\ < 1. Thus, to complete the proof of the 

claim, we have to show that |P(r^)| < 6 . 

First, consider any bud bo £ P°"^(r^) and its associated stem s. By Lemma 8.2(3), s is an original 
node of T^, and so, s G 5 n V{T^). By Lemma 8.2(1), each leaf of is an original node in T'. Let 
bi be the leaf of other than bo. By Lemma 7.2(2), bi is M-covered (otherwise, the twin link bobi 
connects two M-exposed leaves). Hence, bi ^ B°'^^^{Tf). It follows that each stem in 5 n V(T^) has 
at most one leaf in P°"^?(T^). Since |5 n V{Tl)\ < 2 , we have |H°"^(r);)| < 2 . 

Now, consider one of the buds bo £ P‘^°™P(r^); bo is contained in some open compound node and 
one of the original links incident to bo has its other end “outside” this compound node. Then, by 
Lemma 8.3, the buddy link buddylk{bo) shares an end with an M-link in M{T^). Since |M(r)()| < 1 
and each end of an M-link can be incident to < 2 buddy links, we have |P‘^°™^’(r^)| < 4. Therefore, 
|P(r^)| = |H°"^(r^)| + |H‘^°“P(r^)| < 6. This proves the claim. □ 

Lemma 8.5. Suppose that no (up-to-b) greedy contractions are applicable. Let T'^ be a semiclosed 
tree with C{Tf) = 0. Let J he a set of links that each have at least one end in and no end in 
l,matched(^j,r^ U (5 Pi V(T^)). Let X be a feasible solution o/LaS 3 (LPo) such that x is integral on the 
links {6 e{u) : u £ B{Tf)}. Then, 

Hx,TI,) > min{^, ^x(J)}. 

Proof. Let g{x, T^) denote ^ E«e 7 e"°"=p'^Jny(Tp xi^E{u)) + ^Y.uen^po<=‘->nv{T') x{E{u, H-L^“‘^(u))). By 
Fact 4.7, we have <k(x,T,() > g{x,Tf) + X^ 5 gy(T')n 5 We will show that either 

each link £ £ J contributes \x{£) to g{x,Tf), thereby ensuring g{x,T^) > ^x{J), or there exists a 
set of links that contribute ^ to g{x,T^). 
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Consider any link £ G J with x{£) > 0. By Lemma 7.2(2), £ cannot have both ends at M-exposed 

leaves. Thus, £ has at least one end in V (T(,) n TZ, since it has no end in L^iatchedfjpr^ U (5 n B(T^)), 

is semiclosed, and C(T^) = 0. It is easily seen that, except for one case, £ contributes > ^x{£) to 
9{x,T^). 

The exceptional case occurs when £ has original ends bo and w, where bo G and w G 

TZspeci^^(bo) n V(T^), see Fact 2.3. Since J has no links incident to it follows that 

bo G 13{T^) because either bo is an M-exposed original leaf of T(|, or bo is contained in an M-exposed 
compound leaf of T(,; in the latter case, bo is an open bud (w.r.t. T^). Then, we have x{£) = 1 
because x is integral on the links incident with nodes in We claim that g{x,T^) > Let e 

denote the tree-edge between w (the end of £ in V(T^) n TZ) and its parent. Let 5‘*‘(e) denote the 
set of links with positive x-value that cover e. Clearly, x((5‘*'(e)) > 1. Consider any link £q = pq 
that is in (I‘*‘(e), where g is a descendent of w (possibly, q = w) and p is not in T)),. If (7 is a 

proper descendant of w, then (by the definition of is a descendant of the unique 

child of w (see Figure 3), hence, £ and £q form an overlapping pair such that x{£) + x{£q) > 1; 
this contradicts the overlapping constraints of (LPq). Hence, we have q = w. Clearly, p (the other 
end of £q) cannot belong to L^'^^{q) = L^^'^{w) since p is not a descendant of q. Hence, we have 
g{x,T^) > \x{E{w^V—L^'^^{w))) > |x((5+(e)) > This completes the proof. □ 

Let Ty be a semiclosed tree. We construct an auxiliary graph in order to analyze the credits 
available in T^. We denote the auxiliary graph by AG(T^). This is a bipartite multigraph, and the 
two sets in the node bipartition are denoted by AS{T^) and AU{T^). The first set consists of the 
M-covered leaves stems S n H(T^). The second set contains an auxiliary node 

V (informally speaking, v represents the node set V(T')—V(T')), as well as all the M-exposed leaves 
of T^, thus, AU{Tl) = {7} U C/(r^). 

We define the edge set of AG{T^) as follows: for every link pq w.r.t. T' with p G {S G V(T^)) U 
^matched^^ G U{T^), the edge pq is in AG(T^), and for every link pq w.r.t. T' such that p G 
(5nF(r);))UL“^*^^®‘^(T^), q G V{T')-V{T'^), the edgepu is in AG(r^). Thus, AG(r^) is a multigraph 
(multiple copies of an edge may be present), and every edge in AG(T^) corresponds to a link w.r.t. 
T' (see Figure 9). 



Figure 9. The left figure shows a semiclosed tree where dashed lines indicate 
links and the thick dashed line indicates an M-link. The right figure shows the 
auxiliary graph AG{T^) and its node bipartition {s, b, c}, {u, a, d}. 


In what follows, we may abuse the notation by not distinguishing between edges (sets of edges) 
of AG(r^) and the corresponding links (sets of links) w.r.t. T'. 
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Lemma 8 . 6 . Suppose that no (up-to-b) greedy contraetions are applicable. Let be a semiclosed 
tree such that / T', C{T^) = \M{T^)\ < 1 and \S n V{T;^)\ < 2. 

(1) Suppose that M{T^) = 0. Then for any feasible solution x /or LaS 3 (LPo), we have > 

1. Furthermore, T'^ is good. 

(2) Suppose that |M(T^)| = 1, and \U{T^)\ > |5 n V{T!f)\ + 1. Moreover, suppose that x is a 

feasible solution for LASsfLPn) sueh that x\j is integral and ^(x,T!,) < where J = {£ G 
5e{u) ; w G (5 n V{Tf)) U u BiTf,)}. 

Then, |P(r^)| = |5 n V{Tf)\ + 1. Moreover, the auxiliary graph has a perfect matching 
AM{Tf) such that the following conditions hold: 

(i) x{£) = 1 for each link i G AM{Tf), 

(a) the links of AM{Tf) cover T'^, 

(Hi) for each stem node s G 5 n V{T(), twmlk{s) is in AM{T(), 

(iv) AM{T() has no links of the form vs, where s £ S (1 V{T(). 

Proof. Let e„ denote the tree-edge between v and its parent; is well defined since T( / T'. 
Let J = 5Eiev)U (fJueUiTD^Eiu)). Then, x{J) = x{SEiev)) + J2uGU(Ti) ^i^Eiu)) > 1 + |P(T;^)|; the 
equation holds because (i) T( is semiclosed so none of the links in 5E{ev) is incident to an M-exposed 
leaf of T(, and (ii) by Lemma 7.2(2), no link has both ends at M-exposed leaves; the inequality holds 
because x{5Eie)) > 1 for every tree-edge e. 

Let g{x, T() denote 5 x{5e{u)) + \ x{E{u, 

By Fact 4.7, we have ^{x,T() > g{x,Tf) + X;^ 6 V'(Tpn 5 > g{x,Tf). 

Case (1): M{T() = 0. Clearly, |P(r^)| > 1, since T( has at least one leaf. 

Observe that \x{J) > ^(1 + \U{T()\) > 1 . We will show that every link £ £ J contributes 
> \x{£) to g{x,T(). 

First, we show that 5nP {T() = 0. Otherwise, consider any s £ 5nP {Tf); by Lemma 8.2(1), 
every node in Tg is original, and so the twin link of s has both ends at M-exposed nodes 
(since M{T() = 0); this contradicts Lemma 7.2(2). 

Next, we show that B{T() = 0. Since 5nP {T() = 0, Lemma 8.2(3) implies that B°’'‘^(T() = 
0 (if a bud 60 £ B°‘’^^{T() is present as an original M-exposed leaf of T(, then its stem is 
present as an original node of T(). Also, we have B'^°'^p{T() = 0, otherwise, by Lemma 8.3, 
there exists a link in M{T() (this is impossible since M{T() = 0). Thus, B{T() = 0. 

Now, observe that every link £ £ J has an end in V{T() n IZ, and hence, it contributes 
\x{£) to g{x,T(). Therefore, ^{x,T() > 1. Since this inequality holds for every feasible 
solution X of LaS 3 (LPo), we have ^{y,T() > 1. Hence, by Lemma 7.3, T( is good. 

Case (2): In this case, \M(T()\ = 1 and \U{T()\ > |5nH(r^)| -|-1. Moreover, x\j is integral, where 
J = {££ 5e{u) : u G (5 n V{T()) U U B{Tf)}. 

First, we show that x(5£;(s)) < 1 for each stem s G 5 n V{T(). We use a contradiction 
argument. Suppose that x{5e{s)) > 1. Let (5+(s) denote the set of links of positive x-value 
incident to s. Note that every link £ in (5'’'(s) has x{£) = 1 because x\j is integral. Since 
x{Se{s)) > 1, we have |5'''(s)| > 2. This implies that each of the buddy links associated 
with the stem s has x-value zero. To see this, note that the set of links in (5'*'(s) covering 
each of the three tree-edges incident to s is an overlapping clique, hence, at most one link of 
(5+(s) belongs to one of these overlapping cliques (by the overlapping constraints of (LPq)); 
moreover, each buddy link associated with s is overlapping with the links that belongs to 
two of these overlapping cliques (see Figure 10); hence, if a buddy link £ associated with s 
has x{£) > 0, then we get a violation (of an overlapping constraint of {LPq)) for one of the 
three overlapping cliques. Since the buddy links associated with s (if any) have x-value zero, 
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we have <h(x,T^) > slackx(s) > ^ (^x(dE(s)) — x(twinlk(s))j > and this gives the desired 
contradiction. 



Figure 10. Illustration of a buddy link £ in the proof of Lemma 8 . 6 . Note that £ 
is overlapping with links in both and 6E(e) n Se(s), where e is the tree-edge 

between 63 and s. For example, £ is overlapping with the links sp and sb^. 

For any M-covered leaf w in T^, by Lemma 7.2(1), w is an original node, and moreover, 
we have x{Se{w)) < 1, by Lemma 4.3. Thus, we have x{6e{w)) < 1 for each node w G 
U (cS H V{T^)) = AS(r^). 

Let J = this is the set of links in J but not in AG{T^). We have 

x{J) < 1; otherwise, by Lemma 8.5, we would have 4>(x,T^) > minj^, ^x{J)} > which 
is a contradiction. Then, by the conditions in the hypothesis {\M{T^)\ = 1 and \U(T^)\ > 
|5 n V{Tl)\ + 1) and the fact that |AF(r^)| = \U{T^)\ -|- 1, we have 

1 > x{J)=xiJ) - Y. ^iJriSEiw)) > \AUiT')\ -\ASiT')\ 

w£AS(Ti) 

= |[/(r')| + i-2|M(r')|-|5nF(r')| 

= |[/(r')|-(i + |5nF(r')|)>o. 

Hence, we have \U{T^)\ = |5nF(r(|)| + l, and |AS(T^)| = \AU{T^)\. For each re G AS{T^), 
we have x{J n Se{w)) = 1 because x{£) is integral for each link G J n 5e{w) (otherwise, 
I^wgAS(T') ^ ^e{w)) < |AS(r(()| - 1 implies that x{J) > |AF(r^)| - |AS(T^)| -M > 1). 

For each w G AS(T^), define £w to be the link in Se{vj) n J with x{£w) = 1- We claim 
that these links form a perfect matching of the auxiliary graph, thus, AM(T^) = {ly^ : 
w G AS{T^)}. Otherwise, there exist two links £wi,£w2 (where wi,W2 G AS(T(|)) that are 
incident to the same node u G AU{Tl). Then, x(J) > |AF(r^)| + 1, which implies that 
x{J) = x{J) — J2wgAS{T') ^ ^e{w)) > \AU{T^)\ + 1 — |AS(r(()| > 1. This contradicts the 

fact that x(J) < 1. Our claim follows. 

We claim that AM(T^) covers T^. Otherwise, there exists a tree-edge eo in that is not 
covered by AM(T^). Let 5'''(eo) denote the set of links of positive x-value in 5_B(eo). Then, we 
have x((5’''(eo)) > 1, and none of the links in 5’''(eo) is incident to (50 F(r^)) U 
the latter assertion holds because x{£w) = 1 and x(6e(w)) < 1 for every w G AS(T^), i.e., the 
nodes in AS(T^) are already “saturated” by AM(T^). Thus, Lemma 8.5 applies to 5'''(eo), 
and we have ^(x,T^) > minjl, 2 <^“'"(eo)} > 5 , which is a contradiction. Our claim follows: 
AM(T^) covers T^. 

Additionally, we claim that AM{Tl) has no links between v and S O F(r(|). By way of 
contradiction, assume that there exists a link £ = sq G AM{T^) such that s G 5 O V{T^) 
and q ^ V{T^). Suppose that q is an original non-leaf node. Then sq is a link between two 
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original non-leaf nodes with x{sq) = 1, hence, we have <h(x,T^) > ^ due to Fact 4.7 and the 
following term in <l>(x,r^): 

'^xmv-L) n V{T'), (V-L) n (F(T')-F(T')))). 

This is a contradiction. Otherwise, if g is a compound node or an original leaf, then we 
claim that an (up-to-5) greedy contraction applies, and this too gives a contradiction. Note 
that |5 n V{T^)\ < 2 and |M(T^)| = 1, hence, \AM(T^)\ < 4. The credit assigned to the 
leaves and matching links in is | -|- |17(T^)| = | -|- |5 n V{T^)\ -|- 1 = |AM(T^)| + ^- If 
g is a compound node or an original M-exposed leaf, then g provides one additional credit, 
and this suffices for an (up-to-5) greedy contraction of the links of AM(T^); otherwise, g is 
an M-covered leaf, and the M-link ^q incident to g provides additional | credit, and this 
suffices for an (up-to-5) greedy contraction of ^q together with the links of AM(T^). Our 
claim follows: AM(T^) has no link of the form vs, s £ S Ci V{T^). 

Finally, we prove that twinlk{s) is in AM{T'^) for each stem s G 5 n V{T!^). Consider any 
stem s £ Sr\V{T^). By Lemma 8.2(1), is contained in and every node in Tg is original. 
Let the leaves of Tg be a, bo] thus, twinlk{s) = abo- 

Lemma 7.2(2) implies that one end of the twin link abo is M-covered. Since x\j is integral, 
and J contains abo (each link incident to -g either abo £ AM(T^) or 

x(abo) = 0. We may assume that x(abo) = 0. Suppose that x{buddylk(b)) = 0, 

i.e., there does not exist a buddy link incident to Tg that has positive x-value. Consider the 
link £g in AM(T^) incident with s; clearly, x(£s) = 1. Thus, we have <I>(x,r^) > slackx(s) > 
^x{£s) = 5 . This is a contradiction. 



Figure 11. Illustration of £ and £s in the proof of Lemma 8.6. 

Now, we may assume that there exists a buddy link £ £ |J^g^bud(^){ buddylic( 6 )} such that 

x{£) > 0. W.l.o.g., let the original ends of £ be a and 62 . Let g be the least common ancestor 
of a and 62 w.r.t. T; see Figure 11 and also see Figure 2 in Section 2. Observe that 
contains g (either as an original node or within a compound node), because T(, has at least 
3 leaves, whereas every proper subtree of Tq has < 2 leaves. Thus, contains both ends of 
£. Note that 62 (the end of £ not in T'g) may be an original leaf of or it may be contained 
in a compound leaf of (since C{T^) = 0, 62 cannot be contained in a non-leaf compound 
node of T^). 

Observe that £s 0 (s), otherwise, £s,£ form an overlapping pair (since x(£) > 0, x(£s) = 

1). By similar arguments, £s cannot have both ends in P^s- Thus, £s = sbo (£s has an end 
at an M-exposed leaf of Tg because £s £ 5^{s) and £s £ AM(T^)). Let £q denote the link in 
AM{T'^) that covers the tree-edge between g and its parent (possibly, q = v, and in this case, 
£q is the link incident with v in AM{Ty)). Note that £q has an end at a leaf in Tq. Then, 

29 



^q forms an overlapping pair with either ig or ^ (since x{iq) = 1, x{^s) = 1, x{t) > 0). This 
contradicts the overlapping constraints of (LPq). This completes the proof of the lemma. 

□ 

The following theorem is the key result of this paper. It is used only once; see the last paragraph 
of the proof of Theorem 8 . 8 . Note that the last alternative in Theorem 8.7 (item (3)) is addressed 
in the last paragraph of the proof of Theorem 8 . 8 . 

Theorem 8.7. Suppose that no (up-to-5) greedy contractions are applicable. LetT^ be a semiclosed 
tree that is not good. Then one of the following holds for T^. 

(1) is a deficient 3-leaf tree. 

(2) T(, is a deficient A-leaf tree. 

(3) has 4 leaves with |M(T(,)| = 1, and moreover has no cover of size 3. 

Proof. Since is not good, Lemma 7.3 and Lemma 8.6(1) imply that / T', C'(T^) = 0, and 
|M(r')| = 1. 

Observe that has at least one M-exposed leaf. Otherwise, since |M(T^)| = 1, has exactly 
two leaves and there exists an M-link between these two leaves; moreover, by Lemma 7.2(1), every 
node on the path of T' between these two leaves is original; it follows that the link in M[T!^f) is a 
twin link; this contradicts the definition of M. Thus, has an M-exposed leaf and exactly two 
M-covered leaves. Hence, has at least three leaves. 

Also, observe that |5 O V{T^)\ < 2. Otherwise, suppose that |5 O V(T^)\ > 3. Then, by 
Lemma 8.2(1), for every stem s G 5 O V(T^), every node in is original; hence, there exists a 
stem s* G 5 n V(T^) such that both the leaves of T^. are M-exposed and there exists a twin link 
between these two leaves; this contradicts Lemma 7.2(2). 

Let e denote the tree-edge between v and its parent. Let J denote {I G Se{u) : u G (5nH(T^))U 
U where B{T^) = H°"^(r);) U B^°^P{Ti;) (see the discussion before Lemma 8.4 

for the definitions of H°"^(r^) and B^°^P{Tl,)). 

By Lemma 8.4, y can be written as a convex combination Xliez such that x* G LaS 3 (LPo) 
and x*|j is integral. Since is not good and |M(r^)| = 1, Lemma 7.3 implies that 4>(y,T(,) < 
Thus, there exists an io € Z such that <k(x*°,T^) < 

We claim that \U{T[f)\ < |5nH(r)))|-|-1. To see this, suppose that \U{T!^)\ > |5nH(r^)|-|-l. Then, 
all of the conditions of Lemma 8.6(2) apply, hence, the lemma implies that |[/(r^)| = |5nH(r^)|-|-1. 
Our claim follows. 

Now, we analyze a few cases, depending on the number of leaves of T^. By Lemma 8.2(1), for 
every stem s G 5 n V{Tl^), every node in Tg is original; thus, each stem s G 5 n V{T^) contributes 
two original leaves to T^. Therefore, |5 n V(T^)\ < \L{T^)\/2. 

Case 1; has exactly three leaves. Let the three leaves be 0 , 61 , 62 , where a is M-exposed and 
61,62 are M-covered; thus, 6162 is the unique link in M{T^), and 61 , 62 are original nodes by 
Lemma 7.2(1). Clearly, |5 0 V{T^)\ < 1. We have two subcases, depending on |5 0 V{T[f)\. 
Subcase 1.1: \S n V{T^)\ = 1. Let s be the stem in T^. The M-link 6162 cannot have both 
ends in (since M contains no twin links). Thus, the M-exposed leaf o is a leaf of T^. 
Then, it can be seen that a is a bud and buddylk(a} = 6162 (see the definition of bud 
in Section 2). This is a contradiction, since M contains no buddy links. 

Subcase 1.2: |5 n V{T^)\ = 0. Then we have \U{T^)\ = 1 = 1 + |5 n V{T^)\. By 
Lemma 8.6(2)(i-iv), there exist two links G 6 £;(e) and £a £ dE{a) such that x*“(£^) = 
x^°{ia) = 1, these two links cover T(,, and moreover, each of 61,62 is incident to exactly 
one of these two links (since the auxiliary graph has a perfect matching formed by these 
two links). 
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If there is only one non-leaf node with degree other than 2 in (see Figure 6 (a)), 
then is a deficient 3-leaf tree. We are done. Otherwise, we have exactly two non-leaf 
nodes u, q in with degree other than 2. In fact, both these nodes have degree 3 since 
T(, has exactly 3 leaves. W.l.o.g., let rt be a proper ancestor of q. Then, has only two 
leaves. By the argument at the beginning of the proof, the M-link in T(( cannot connect 
both leaves in T^. This implies that one leaf of is M-exposed. Thus, a is a leaf of 
Tg. W.l.o.g., let the other leaf of be 62 - Then, bi is the third leaf, which is not in T^. 

Suppose that is incident to bi and ia is incident to 62 - Then, the tree-edge be¬ 
tween q and its parent is not covered by these two links (see Figure 12(a)). This is a 
contradiction. Hence, is incident to 62 and ia is incident to bi (see Figure 12(b)). 
Therefore, satisfies all the conditions of a deficient 3-leaf tree. 



Figure 12. The links iv and ia in Subcase 1.2 of the proof of Theorem 8.7. The 
dashed lines indicate links and the thick dashed lines indicate M-links. 

Case 2: has exactly four leaves. Let the four leaves be oi, 02 , 61 , 62 ; where ai, 02 are M-exposed 

and 61,62 are M-covered; thus, 6162 is the unique link in M(T))), and 61,62 are original nodes 
by Lemma 7.2(1). We have 1 = \U{T^)\ — 1 < |5 n V{T^)\ < 2. As above, we have two 
subcases, depending on |5 n V(T^)\. 

Subcase 2.1: |5 n V{T^)\ = 1. Let s be the stem in T^. By Lemma 7.2(2), it can be seen 
that the twin link of s is incident to one M-exposed leaf, say ai, and to one M-covered 
leaf, say 61 (see Subcase 1.1). 

Note that \U(T^)\ = 2 = 1 -|- |5 n V{Tl)\. By Lemma 8.6(2)(i-iv), there exist three 
links aibi (the twin link of s), G 5£;(e) and ia 2 G <^£(< 22 ) such that a;*°(ai 6 i) = 
x^°iiv} = 2 :*° (^ 02 ) = 1) these three links cover T(,, and moreover, each of s, 62 is incident 
to exactly one of the two links iy , ia 2 (since the auxiliary graph has a perfect matching 
formed by the three links); moreover, s cannot be incident to i^ (by Lemma 8.6(2)(iv)). 
Thus iv is incident to 62 , and ia 2 is incident to s, i.e., ia 2 = a 2 S- 

Let p be the least common ancestor of s and 02 (in T^). If does not contain all 
the leaves of , then it can be seen that the tree-edge between p and its parent is not 
covered by the three links aibi,a 2 S,iv (see Figure 13(a)). This is a contradiction (since 
T(, is covered by {aibi,a 2 S,iv})- It follows that Tp contains all the leaves of T(,. Then, 
it can be seen that T(, is a deficient 4-leaf tree (see Figure 13(b)). 

Subcase 2.2: |5 n V{Tl)\ = 2. If has no cover of size 3, then item (3) in the statement 
of the theorem holds. Thus, we may assume that has a cover of size 3. 

Let si,S 2 be the two stems in T^. Let ai,bi be the original leaves of and let 
02 , 62 be the original leaves of T'^^. We may assume that oi, 02 are M-exposed and 61,62 
are M-covered, because each twin link is incident to one M-exposed leaf and to one 
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Case 



Figure 13. The links ia 2 3-nd aibi in subcase 2.1 of the proof of Theorem 8.7. 
The dashed lines indicate links and the thick dashed lines indicate M-links. Another 
case could occur, but it is not shown here (see Figure 7). 


M-covered leaf by Lemma 7.2(2) (see Subcase 2.1). Thus, 6162 is the unique link in 
M(r'). 

By Lemma 8.2(1) and C(T^) = 0, has no compound nodes, i.e., all its nodes are 
original. 

Observe that has at least 3^ credits from the two M-exposed leaves and the 
M-link. 

Consider the possible leaf-to-leaf links of T^. The link 0102 does not exist, by 
Lemma 7.2(2), since 01,02 are both M-exposed. If both the links 0162 and 0261 ex¬ 
ist, then an (up-to-5) greedy contraction applies (we contract these two links and we 
have 3| credits). Thus, at most one of the links 0162 or 0261 is present. 

If has no leafy 3-cover w.r.t. T, then it satishes all the conditions for a bad 2-stem 
tree, hence, it would have been contracted in Preprocessing step 1 (A-contraction). This 
is a contradiction. ^ 

Thus, we may assume that has a leafy 3-cover J = w.r.t. T, where 

io = uw is a link such that one end u is in r„ and the other end w is a leaf in L—L{T^). 
Let £q = uw' be the link w.r.t. T' that corresponds to £ 0 , where w' = w if w is still an 
original node in T', and otherwise, w' is the compound node containing w. Note that 
in either case, w' is not in F(T^) since every node in is original. If w' is a compound 
node or an M-exposed original leaf, then an (up-to-5) greedy contraction applies (we 
contract the 3 links in J and we have 4^ credits). Otherwise, if w' is an M-covered 
original leaf, then again an (up-to-5) greedy contraction applies (we contract 4 links, 
namely, the links in J and the M-link incident to w, and we have 3^ -|- | = 5 credits). 
Thus, the existence of a leafy 3-cover gives a contradiction. 

3: has at least five leaves. Then, has at least three M-exposed leaves and two M-covered 

leaves. Thus, we have 3 < \U{T^)\ < |5 n V{T^)\ -f 1 < 3, since |5 n V{T^)\ < 2; it follows 
that \U(T^)\ = 3, |5 n V(T^)\ = 2, and has exactly 5 leaves. 

Let si,S 2 be the two stems in T^. Let ai,bi be the original leaves of and let 02,62 
be the original leaves of T'^^. We may assume that 01,02 are M-exposed and 61,62 are M- 
covered, because each twin link is incident to one M-exposed leaf and to one M-covered leaf 
by Lemma 7.2(2) (see Subcase 2.1). Thus, 6162 is the unique link in M{T^). Let u be the 
hfth leaf of T^, where u is not in nor in 

Since \U{T^)\ = 3 = 1-f |5n F(r)()|, Lemma 8.6(2)(i-iv) implies that there exist four links 
£v £ £u £ Se{u), 0161 , 0262 (the twin links of si, S 2 ) such that each of si, S 2 is incident 
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to exactly one of the two links , lu (since the auxiliary graph has a perfect matching formed 
by these four links); moreover, cannot be incident to {si, S 2 } (by Lemma 8.6(2)(iv)). This 
is impossible. Thus, cannot have more than four leaves. 

The theorem follows from the above case analysis. □ 

8.3. Addressing deficient trees. This section discusses the missing piece of the algorithm, namely, 
the handling of deficient trees. Even, et ah, see [5, Section 4.3], presented an elegant method for 
addressing deficient 3-leaf trees. We use essentially the same method in Part I (Section 7.2). The 
key point is to compute another (auxiliary) matching of the leaf-to-leaf links of T', and then to find 
a minimally semiclosed tree T(, w.r.t. this auxiliary matching; moreover, we prove (in Theorem 7.4 
of Part I) that T(, is a good semiclosed tree w.r.t. the “original matching,” and T'^ has a fitting cover 
of appropriate size. In this section, we extend this method to address deficient 3-leaf trees as well as 
deficient 4-leaf trees. The extension is straightforward (but non-trivial). We start by “transforming” 
the current tree T' to another tree T ; this is achieved by applying “latch contractions” to the latches 
of deficient 4-leaf trees in an appropriate way (the details are discussed below). Then, we apply the 
method from Part I (Section 7.2) to T. Thus, the main goal in this section is to find a semiclosed 
tree w.r.t. M of the current tree T' such that is good (i.e., it has > |r(M, T^)| -|- 1 credits) 
and has a fitting cover of size |r(M, r^)|. 

For a deficient 3-leaf tree T^, if is not a proper subtree of another deficient 3-leaf tree, then 
we call a maximal deficient 3-leaf tree. Similarly, we define a maximal deficient A-leaf tree. By 
Property 2.1, any two different maximal deficient 3-leaf trees are disjoint; similarly, any two different 
maximal deficient 4-leaf trees are disjoint. 

Let denote the set of latches of all maximal deficient 4-leaf trees of T'. Let T = 

We mention that T is an auxiliary graph that is used for achieving the main goal of this section 
(find a semiclosed tree with desired properties), and other than that, the algorithm does not 
refer to T at all; thus, the algorithm does not add any latch to F while constructing T, nor does it 
incur any credits while constructing T. By a latched-node we mean a node of T that results from 
the contraction of a latch (note that these are not compound nodes of T', and the algorithm does 
not assign any credits to a latched-node). 

Since two different maximal deficient 4-leaf trees are disjoint, each “latch contraction” results in 
a distinct latched-node; thus, there is a bijection between the latches of T' and the latched-nodes of 
T. For a subtree of T, we use latch{Tw) to denote the set consisting of latches that correspond 
to latched-nodes of T^. ^ 

For a node w of T', let w denote the node of T that corresponds to tc. If there exists no latch cs of 
T' such that w is in P'^g (the path of T' between c, s), then w = w, otherwise, w is the latched-node 
formed by contracting the (unique) latch cs such that w is in P'^ ^ (in this case, the latch cs is unique 
because two different maximal deficient 4-leaf trees are disjoint). 

Then, as in Part I (Section 7.2), we construct an auxiliary matching of the leaf-to-leaf links of 
T denoted by For ease of exposition, we denote the image (in T) of M{T') by M{T), and 

we refer to the image (in T) of a link in M{T') as an M-link w.r.t. T (this is justified because any 
M-link w.r.t. T' has no ends in common with the nodes of T' that get contracted into latched- 
nodes). To construct we start with ;= M, then we examine each maximal deficient 

3-leaf tree T^j;, of T and we replace the unique link of M{T^) in by another leaf-to-leaf link. 

In more detail, consider any maximal deficient 3-leaf tree T^, and let the three leaves be a,b,d, 
where a is M-exposed, b is the ceiling leaf, and bd is the unique link in M{Tq,)-, we remove the 
M-link bd from M”®"' and instead we add the link ad to M”®" (see Algorithm 2). Since any two 
different maximal deficient 3-leaf trees are disjoint, this replacement takes place independently for 
each maximal deficient 3-leaf tree. We mention that M°®" is an auxiliary matching that is used for 
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achieving the main goal of this section (find a semiclosed tree with desired properties), and other 
than that, the algorithm does not refer to at all; whereas, the matching M (and its image 

M{T')) are used throughout the algorithm and its analysis. 

Theorem 8.8. Suppose that no (up-to-5) greedy contraetions are applieable. Let T = T'/ , 
and let Ty be a minimally semiclosed tree w.r.t. Then, v cannot be a latehed-node ofT; thus, 

V = V, where v is a node ofT'. Moreover, Ty is a good semielosed tree w.r.t. M and Ty has a fitting 
cover r(M”®'^, fy) U latch{fy) of size \T{M, Tl)\. 

Proof. We start by addressing the first statement. By way of contradiction, suppose that u is a 
latched-node. Observe that every latched-node is contained in a deficient 3-leaf tree of T (a latched- 
node is formed by a “latch contraction” that results in a deficient 3-leaf tree). Hence, Ty is a subtree 
of a maximal deficient 3-leaf tree of T. But, by our construction of M”®”', the ceiling leaf of every 
maximal deficient 3-leaf tree is M”®'^-exposed, hence, no subtree of a maximal deficient 3-leaf tree 
is semiclosed w.r.t. Thus, Ty is not semiclosed w.r.t. M”®"'. This is a contradiction. 

It follows that V cannot be a latched-node. Thus, there exists a node v in T' such that v = v; 
moreover, Ty = Ty, and T^ is well defined. 


Claim 8.9. (1) If Ty has a node in a maximal deficient A-leaf tree ofT', then Ty 
properly contains this maximal deficient A-leaf tree. 

(2) IfTy has a node in a maximal deficient 3-leaf tree ofT', then Ty properly eontains 
this maximal deficient 3-leaf tree. 

We address statement (1) of the claim. Suppose that Ty has a node in a maximal deficient 4-leaf 
tree Tfi. By “latch contraction,” T^ is transformed to a deficient 3-leaf tree of T. Thus, Ty has 
a node in a maximal deficient 3-leaf tree Ty, where u is an ancestor of w in T. But, (as discussed 
above) no subtree of Ty is semiclosed w.r.t. M”®”'. Hence, Ty cannot be a subtree of Ty. Then 
Property 2.1 implies that Ty properly contains Ty. Hence, u is a proper ancestor of w in T' , and Ty 
properly contains Tfi. 

Next, we address statement (2) of the claim. Suppose that Ty has a node in a maximal deficient 
3-leaf tree Tfi. If this maximal deficient 3-leaf tree is contained in a maximal deficient 4-leaf tree, 
then we are done by the previous discussion. Hence, by Property 2.1, we may assume that Tfi has 
no node in a maximal deficient 4-leaf tree. Then T{^ contains no latched-node, and = Tyj = T^ is 
a maximal deficient 3-leaf tree of T. Since Ty has a node in Ty,, and no subtree of Ty, is semiclosed 
w.r.t. M”®", Property 2.1 implies that Ty properly contains Ty,. It follows that Ty properly contains 
T'y,. This proves the claim. 

Claim 8.9 implies that Ty is neither a deficient 3-leaf tree nor a deficient 4-leaf tree. 

Next, we address the second statement in the theorem. To show that Ty is semiclosed w.r.t. M, 
we must show that every link incident to an M-exposed leaf of Ty has both ends in Ty, and every 
M-link incident to Ty has both ends in Tfi Let u be an M-exposed leaf of T'y. Suppose that u is 
contained in Tfi, where is either a deficient 3-leaf tree or a deficient 4-leaf tree (w.r.t. T'). Then 
Ty contains Tfi, by Claim 8.9. Since T^y is a semiclosed tree w.r.t. M (by definition of deficient trees), 
all links incident with u have both ends in T^y, hence, also in Tfi Next, suppose that u does not 
belong to any deficient 3-leaf tree or any deficient 4-leaf tree (w.r.t. T'). Such an M-exposed node 
of Ty is an M”®'^-exposed node of T. Since Ty is semiclosed w.r.t. M”®"', it follows that Ty contains 
both ends of each link incident with u = w, hence, Ty contains both ends of each link incident with 
u. Next, consider an M-link uw incident to Ty, where n is a node of Tfi If rt is contained in Tfi, 
where T^y is a deficient 3-leaf tree or a deficient 4-leaf tree, then note that T^y is semiclosed, and 
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moreover, is properly contained in hence, both ends of uw are in T^. Otherwise, uw has no 
end in a deficient 3-leaf tree or in a deficient 4-leaf tree; then, by our construction of we have 

uw G (M and are the same, except “inside” some deficient tree). Since Ty is semiclosed 

w.r.t. it follows that uw has both ends in Ty, hence, uw has both ends in Ty. Thus, Ty is 

semiclosed w.r.t. M. ^ 

By the construction of T and M”®”', Claim 8.9, and the fact that Ty is semiclosed, we have 
\M{Ty)\ = |M”®"(T^)|; moreover, the number of M-exposed nodes of Ty is equal to the sum of 
the number of M”®"-exposed nodes of Ty and |iatch(T„)| (since each latched-node of Ty corresponds 
to an M-exposed leaf of a maximal deficient 4-leaf tree of Ty). Hence, we have \T{M,Ty)\ = 
|r(M“®", T„)| -|- \latch{Ty)\. Clearly, r(M”®'^, T^) is a fitting cover of Ty, because Ty is a minimally 
semiclosed tree w.r.t. M”®"' (by Lemma 6.2). It follows that r(M°®", T^,) U latch(Ty) is a fitting 
cover of of size |r(M,r^)|. 

Finally, we apply Theorem 8.7 to prove that is good. Suppose that T^ is not good. By 
Claim 8.9, T^ is neither a deficient 3-leaf tree nor a deficient 4-leaf tree. Then, Theorem 8.7(3) 
implies that T^ has 4 leaves, \M{T^)\ = 1, and T^ has no cover of size 3. Since |M(r^)| = 1 and 
\U{T^)\ = 2, we have \T{M,T^)\ = 3. The previous paragraph shows that T(, has a fitting cover of 
size |r(M, T(,)| = 3. Thus, we have a contradiction. It follows that T^ is good. This completes the 
proof. □ 

The procedure described above to find a good semiclosed tree is summarized in the following 
pseudocode. 

let denote the set of latches of all maximal deficient 4-leaf trees of T'; 

let f ;= 

start with M”®'^ := M; 

for each maximal deficient S-leaf tree T^ in T do 

let b be the ceiling leaf, a be the M-exposed leaf and db be the M-link in 
update M”®"' by replacing db by da (M”®"' ;= M°®"—{dfe} U {da}); 

end 

find a minimally semiclosed tree Ty w.r.t. M”®"; 

T^ is a good semiclosed tree w.r.t. M with a fitting cover T{M’^^'^,Ty) U latch{Ty) of size 

\T{M,Tfi)\ (by Theorem 8.8); 

return Ty and its fitting cover T{M’^^'^,Ty) U latch{Ty); 

Algorithm 2: Find a good semiclosed tree by addressing deficient trees. 

Algorithm 2 shows how to find a good semiclosed tree Ty together with a fitting cover of size 
|F(M, T{)| for the main loop in Algorithm 1 in Section 7. (The first step of Algorithm 2 applies 
“latch contractions” to construct the auxiliary tree T from T'; “latch contractions” have no effect 
on F or on the credits of the algorithm.) 

In conclusion. Algorithm 1 runs in polynomial time and returns a solution for TAP of size < 
(| -|- e)y{E). This proves Theorem 6.1. 

Although our integrality ratio (and approximation guarantee) is proved relative to a “weaker 
lower bound” than the optimum value (since the feasible region of a relaxation is a superset of the 
convex full of integer solutions), we loose an additive term of e in the approximation guarantee, 
compared to the | approximation guarantee of [10]. But, we can show that our methods achieve an 
approximation guarantee of | relative to the (integral) optimum value. Observe that the algorithm 
is combinatorial and does not refer to the SDP relaxation at all (our analysis does rely on the SDP 
relaxation and properties of its solutions). Thus, we can apply the algorithm and analysis for the 
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level t = |£^| tightening of (LPq), and at this level, the set of (feasible) solutions of LASi(LPo) 
corresponds to the convex hull of integer solutions. Consequently, for level t = \E\, Lemma 4.4 can 
be restated as follows: if y G LASt(LPo), where t = \E\, then y\E{L) is in the matching polytope 
of G{L) = {L,E{L)). Hence, for level t = \E\, the upperbound on the potential function stated in 
Lemma 5.2 can be improved from (| + e)y{E) to ^y{E). In other words, for y G LASt(LPo), where 
t = \E\, the potential function is < |y(P). Sections 6-8 show that Algorithm 1 returns a solution 
for TAP whose size is < the potential function. Thus, we have the following result. 

Corollary 8.10. Lety denote an optimal (integral) solution of TAP. Algorithm 1 runs in polynomial 
time, and returns a solution for TAP of size < ^y{E). 

It can be seen that Algorithm 1 can be implemented to run in time 0(|P|^ + \V\^m^, where mo 
denotes the number of (multi) links in the input and we assume that mo = 0{\V\'^). This can be 
achieved using simple data structures such as adjacency matrices and adjacency lists; the details are 
not straightforward, but we skip them. To handle the (up-to-5) greedy contractions, every subset 
of size < 5 of the set of maximal links has to be examined (for each link uw in a subset, Pu,w has to 
be examined), and there are 0(|P|) such contractions, hence, these contractions contribute a total 
of 0(|PpmQ) to the running time. 

Theorem 8.11. Algorithm 1 can he implemented to run in time 0(|Pp + iPpm^). 
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